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1 5 entirety. 

FIELD OF THE INVENTION 

The present invention relates to small molecules, peptides, peptide analogues, 
proteins, and methods for using such compositions to regulate signalling pathways in cells. In 
one aspect, the invention provides methods of modulating localization or function of receptors 
20 that bind heterotrimeric G proteins by antagonizing or promoting binding between a PDZ 
domain containing protein and a protein that binds a PDZ domain. 

BACKGROUND 

G-Protein-Coupled-Receptors (GPCRs) constitute the largest family of cell 
surface molecules involved in signal transmission. These receptors play key physiological 
25 roles and their dysfunction results in diseases or disorders such as immune and cardiovascular 
disorders including asthma and inflammation, neurological disorders including anxiety, 
memory, depression, and cognition, endocrine disorders and more. Their importance in 
physiological systems has made them one of the most-often targeted classes of proteins for drug 
discovery. 

30 Estimates of the number of different GPCRs in the human genome range from 

300 to over 1000. This makes drug discovery a complex process requiring significant trial and 
error in the identification of compounds that will inhibit a single GPCR. In an effort to reduce 



the complexity of this effort, many groups are attempting to develop drugs that inhibit 
interactions between hetero- or homo-dimerized GPCRs, regulatory features such as 
phosphorylation sites, or inhibition of specific G-protein binding to GPCRs. There still exists a 
great need to be able to effectively identify therapeutics that target this class of proteins. 
5 One class of GPCRs in need of additional therapeutic inhibitors is the alpha 

adrenergic receptors. Six alpha adrenergic receptors have been identified at this time: alpha 
1A, IB and 1C and alpha 2 A, 2B and 2C. Alpha 1 receptors have been shown to mediate 
actions in the sympathetic nervous system through binding of hormones such as 
catecholamines, epinephrine and norepinephrine. Alpha 2 receptors have been shown to play 

10 roles in regulting neurotransmitter release from sympathetic and adrenergic neurons in the 
central nervous system. The tissue distributions differ between members of each group of 
receptors, arguing a need for type specific or sub-type specific therapeutics. Specific 
antagonists and agonists of certain alpha adrenergic receptors (aAR's) have been identified, but 
the pharmacokinetic profiles of certain alpha 1 adrenergeric receptors (al AR's) demonstrate 

1 5 that they penetrate the blood brain barrier, potentially giving rise to adverse side effects (Pool 
JL. Int. Urol Nephrol 2001, 33(3):407). However, several indications merit therapeutic 
targeting of brain functions, so the need for blood brain barrier penetrance will be receptor type 
and disease specific. Alpha 1 receptors have been experimentally implicated in depression, 
lower urinary tract storage, migraines, prostate apoptosis, and 

20 hypertrophy/proliferation/migration of vascular smooth muscle following carotid balloon 
injury. Alpha 2 receptors have been experimentally implicated in migraine, glucose 
metabolism, coronary flow reserve after stenting, Alzheimer's, Parkinson's, neuroprotection, 
glaucoma, and opioid withdrawl management. We have demonstrated binding between alpha 
adrenergic receptors and PDZ proteins, thus allowing a novel set of targets to treat the disorders 

25 listed above. 

We have identified that PDZ proteins can organize and regulate the expression 
and function of a subset of GPCRs. PDZ domain-containing proteins have since been shown to 
regulate a myriad of cellular functions from vesicular trafficking, tumor suppression, protein 
sorting, establishment of membrane polarity, and apoptosis. A common function of this family 
30 is to facilitate the assembly of multi-protein complexes, often serving as a bridge between 
several proteins. By possessing multiple PDZ domains, many PDZ-containing proteins act as 
organizers within the cell by increasing the local concentration of one or more proteins, and by 
regulating the localization of the clusters through interactions with the cytoskeleton or other 
organelles. One such protein, EBP50 has been shown to be an essential link between the p2- 
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adrenergic receptor and the actin cytoskeleton, regulating its proper endocytosis and recycling 
to the plasma membrane. Another protein containing multiple PDZ domains, PDZK1 5 is 
essential for regulating ion conductance and polarized membrane distribution of the cystic 
fibrosis chloride channel. Others contain intrinsic enzymatic activity, and use their PDZ 
5 domains to localize the enzyme with its appropriate substrates. Thus, PDZ domains represent 
an important means by which the cell regulates the organization, localization, and function of 
proteins. The function of PDZ domains in certain biological systems is described, for example, 
in published PCT applications that are commonly owned by the assignee of the instant 
application (see, e.g., WO 00/13161, WO 00/69898 and WO 00/69897) , each of which is 

10 incorporated herein by reference in its entirety for all purposes). PDZ interactions with their 
ligands have been shown to be amenable to therpaeutic intervention (Aarts et al, Science 2002, 
298:846), thus underscoring the therpaeutic potential for these interactions. 

The following publications are of interest: Stone (2003) 
Neuropsychopharmacology 28(8): 1387-99; Djavan (2003) Urology 62:6-14; Willems (2003) 

15 Cephalalgia 23(4):245-57; Anglin (2002) Prostate Cancer Prostatic Dis. 5(2):88-95; Pool 

(2001) Int Urol Nephrol 33(3):407-12; Roehrbora (2002) 59:3-6; Velliquette (2003) J 
Pharmacol Exp Ther 306(2):646-57; Stewart (2002) Circulation 106(23):2946-54; Gregorini 

(2002) Cirulcation 106(23):2901-7; Debeir (2002) Neurosci 1 15(l):41-53; Teeters (2003) Am J 
Physiol Heart Circ Physiol 284(l):H385-92; Savola (2003) Mov Disord 18(8):872-83; Wheeler 

20 (2003) Surv Ophthalmol 48supl:S47-51; Tatton (2003) Surv Ophthalmol 48supl:S25-37; 
Gowing (2003) Cochrane Database Sst Rev (2):CD002024; Pupo (2002) BMC Pharmacology 
2:17-33. 

In addition, the following patents and patent applications are of interest: Soppet, 
5,994,506; Pausch 6,406,871. 

25 SUMMARY 

Methods and compositions for modulating biological function in a variety of 
cell types (e.g., hematopoietic, neuronal, brain, stem, epidermal and epithelial) are provided 
herein. These methods and compositions can be utilized to treat various maladies such as 
immune disorders, nervous system disorders and muscle disorders, for example. More 
30 specifically, these methods and compositions are for modulating binding between certain 
PDZ proteins and PL protein binding pairs as shown in Table 8. 

Certain methods involve introducing into the cell an agent that alters binding 
between a PDZ protein and a PL protein in the cell, whereby the biological function is 
modulated in the cell, and wherein the PDZ protein and PL protein are a binding pair as 

3 



specified in Table 2. In some of these methods, the agent is a polypeptide comprising at 
least the two or three carboxy-terminal residues of the PL protein. 

Screening methods to identify compounds that modulate binding between 
PDZ proteins and PL peptides or proteins are also provided. Some screening methods 
5 involve contacting under suitable binding conditions (i) a PDZ -domain polypeptide having 
a sequence from a PDZ protein, and (ii) a PL peptide, wherein the PL peptide comprises a 
C-terminal sequence of the PL protein, the PDZ -domain polypeptide and the PL peptide 
are a binding pair as specified in Table 2; and contacting is performed in the presence of the 
test compound. Presence or absence of complex is then detected. The presence of the 

10 complex at a level that is statistically significantly higher in the presence of the test 

compound than in the absence of test compound is an indication that the test compound is 
an agonist, whereas, the presence of the complex at a level that is statistically significantly 
lower in the presence of the test compound than in the absence of test compound is an 
indication that the test compound is an antagonist. 

15 Modulators of binding between a PDZ protein and a PL protein are also 

described herein. In certain instances, the modulator is (a) a peptide comprising at least 3 
residues of a C-terminal sequence of a PL protein, and wherein the PDZ protein and the PL 
protein are a binding pair as specified in Table 2; or (b) a peptide mimetic of the peptide of 
section (a); or (c) a small molecule having similar functional activity with respect to the 

20 PDZ and PL protein binding pair as the peptide of section (a). The modulator can be either 
an agonist or antagonist. Such modulators can be formulated as a pharmaceutical 
composition. 

Methods of treating a disease correlated with binding between a PDZ protein 
and a PL protein are also disclosed herein, the method comprising administering a 
25 therapeutically effective amount of a modulator as provided herein, wherein the PDZ 
protein and the PL protein are a binding pair as specified in Table 2. As indicated supra, 
such methods can be used to treat a variety of diseases such as neurological disease, an 
immune response disease, a muscular disease, or a cancer. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows the interaction between Interleukin 8 receptor A (IL8RA) and 
the PDZ proteins MAGI1 (domain 2 of 6), TIP1 (domain 1 of 1) and MINT2 (domains 1 & 2) 
in the in vitro "G" Assay. For each of the three PDZ proteins, the OD (A450) of the interaction 
with IL8RA is shown in dark gray. The negative control for each of these three reactions is the 



interaction of GST with IL8RA peptide, the results of which are shown in light gray. 

DESCRIPTION 

I. Definitions 

5 A "fusion protein" or "fusion polypeptide" as used herein refers to a composite 

protein, i.e., a single contiguous amino acid sequence, made up of two (or more) distinct, 
heterologous polypeptides which are not normally fused together in a single amino acid 
sequence. Thus, a fusion protein can include a single amino acid sequence that contains two 
entirely distinct amino acid sequences or two similar or identical polypeptide sequences, 

10 provided that these sequences are not normally found together in the same configuration in a 
single amino acid sequence found in nature. Fusion proteins can generally be prepared using 
either recombinant nucleic acid methods, i.e., as a result of transcription and translation of a 
recombinant gene fusion product, which fusion comprises a segment encoding a polypeptide of 
the invention and a segment encoding a heterologous protein, or by chemical synthesis methods 

1 5 well known in the art. 

A "fusion protein construct" as used herein is a polynucleotide encoding a fusion 

protein. 

20 As used herein, the term "PDZ domain" refers to protein sequence (i.e., modular 

protein domain) of approximately 90 amino acids, characterized by homology to the brain 
synaptic protein PSD-95, the Drosophila septate junction protein Discs-Large (DLG), and the 
epithelial tight junction protein ZOl (ZOl). PDZ domains are also known as Discs-Large 
homology repeats ("DHRs") and GLGF repeats. PDZ domains generally appear to maintain a 

25 core consensus sequence (Doyle, D. A., 1996, Cell 85: 1067-76). 

PDZ domains are found in diverse membrane-associated proteins including 
members of the MAGUK family of guanylate kinase homologs, several protein phosphatases 
and kinases, neuronal nitric oxide synthase, and several dystrophin-associated proteins, 
collectively known as syntrophins, 

30 Exemplary PDZ domain-containing proteins and PDZ domain sequences are 

shown in TABLE 6. The term "PDZ domain" also encompasses variants (e.g., naturally 
occurring variants) of the sequences of TABLE 6 (e.g., polymorphic variants, variants with 
conservative substitutions, and the like). Typically, PDZ domains are substantially identical to 

5 



those shown in TABLE 6, e.g., at least about 70%, at least about 80%, or at least about 90% 
amino acid residue identity when compared and aligned for maximum correspondence. 

As used herein, the term "PDZ protein" refers to a naturally occurring protein 
5 containing a PDZ domain. Exemplary PDZ proteins include CASK, MPP1, DLG1, PSD95, 
NeDLG, TIP33, SYNla, TIP43, LDP, LIM, LIMK1, LIMK2, MPP2, NOS1, AF6, PTN-4, 
prIL16,41.8kD, KIAA0559,RGS12,KIAA0316,DVL1,TIP40,TIAM1,MI^ 
CBP, MINT3, TIP2, KIAA0561, and those listed in TABLE 6. 

10 As used herein, the terma "PDZ-domain polypeptide" or "PDZ polypeptide" 

refer to a polypeptide containing a PDZ domain, such as a fusion protein including a PDZ 
domain sequence, a naturally occurring PDZ protein, or an isolated PDZ domain peptide. 

As used herein, the term "G-protein coupled receptor," or "GPCR," refers to a 
1 5 naturally occuring polypeptide receptor, or a polynucleotide encoding a receptor, known to 
interact with G-proteins or have homology to proteins known to interact with G proteins. In 
addition, this definition includes polypeptide receptors, or polynucleotides encoding receptors, 
that are similar to those known to interact with G-proteins. A partial list of known GPCR's is 
presented in TABLE 3. 

20 

As used herein, the term "PL protein" or "PDZ Ligand protein" refers to a 
naturally occurring protein that forms a molecular complex with a PDZ-domain, or to a protein 
whose carboxy-terminus, when expressed separately from the full length protein (e.g., as a 
peptide fragment of 4-25 residues, e.g., 8, 10, 12, 14 or 16 residues), forms such a molecular 
25 complex. The molecular complex can be observed in vitro using the "A assay" or "G assay" 
described infra, or in vivo. Exemplary PL proteins listed in TABLE 2 are demonstrated to bind 
specific PDZ proteins. This definition is not intended to include anti-PDZ antibodies and the 
like. 

As used herein, GPCR-PL refers to a PDZ ligand sequence that occurs within a 
30 GPCR polypeptide sequence. 

As used herein, a "PL sequence" refers to the amino acid sequence of the C- 
terminus of a PL protein (e.g., the C-tenninal 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 14, 16, 20 or 25 
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residues) ("C-terminal PL sequence") or to an internal sequence known to bind a PDZ domain 
("internal PL sequence). 

As used herein, a "PL peptide" is a peptide of having a sequence from, or based 
5 on, the sequence of the C-terminus of a PL protein. Exemplary PL peptides (biotinylated) are 
listed in TABLE 2. 

As used herein, a "PL fusion protein" is a fusion protein that has a PL sequence 
as one domain, typically as the C-terminal domain of the fusion protein. An exemplary PL 
1 0 fusion protein is a tat-PL sequence fusion. 

As used herein, the term "PL inhibitor peptide sequence" refers to PL peptide 
amino acid sequence that (in the form of a peptide or PL fusion protein) inhibits the interaction 
between a PDZ domain polypeptide and a PL peptide (e.g., in an A assay or a G assay). 

15 

As used herein, a "PDZ-domain encoding sequence" means a segment of a 
polynucleotide encoding a PDZ domain. In various embodiments, the polynucleotide is DNA, 
RNA, single stranded or double stranded. 

20 As used herein, the terms "antagonist" and "inhibitor," when used in the context 

of modulating a binding interaction (such as the binding of a PDZ domain sequence to a PL 
sequence), are used interchangeably and refer to an agent that reduces the binding of the, e.g., 
PL sequence (e.g., PL peptide) and the, e.g., PDZ domain sequence (e.g., PDZ protein, PDZ 
domain peptide). 

25 

As used herein, the terms "agonist" and "enhancer," when used in the context of 
modulating a binding interaction (such as the binding of a PDZ domain sequence to a PL 
sequence), are used interchangeably and refer to an agent that increases the binding of the, e.g., 
PL sequence (e.g., PL peptide) and the, e.g., PDZ domain sequence (e.g., PDZ protein, PDZ 
30 domain peptide). 

As used herein, the terms "peptide mimetic, " "peptidomimetic," and "peptide 
analog" are used interchangeably and refer to a synthetic chemical compound which has 
substantially the same structural and/or functional characteristics of an PL inhibitory or PL 

7 



binding peptide of the invention. The mimetic can be either entirely composed of synthetic, 
non-natural analogues of amino acids, or, is a chimeric molecule of partly natural peptide 
amino acids and partly non-natural analogs of amino acids. The mimetic can also incorporate 
any amount of natural amino acid conservative substitutions as long as such substitutions also 
5 -do not substantially alter the mimetic' s structure and/or inhibitory or binding activity. As with 
polypeptides of the invention which are conservative variants, routine experimentation will 
determine whether a mimetic is within the scope of the invention, i.e., that its structure and/or 
function is not substantially altered. Thus, a mimetic composition is within the scope of the 
invention if it is capable of binding to a PDZ domain and/or inhibiting a PL-PDZ interaction. 

1 0 Polypeptide mimetic compositions can contain any combination of nonnatural 

structural components, which are typically from three structural groups: a) residue linkage 
groups other than the natural amide bond ("peptide bond") linkages; b) non-natural residues in 
place of naturally occurring amino acid residues; or c) residues which induce secondary 
structural mimicry, i.e., to induce or stabilize a secondary structure, e.g., a beta turn, gamma 

15 turn, beta sheet, alpha helix conformation, and the like. 

A polypeptide can be characterized as a mimetic when all or some of its residues 
are joined by chemical means other than natural peptide bonds. Individual peptidomimetic 
residues can be joined by peptide bonds, other chemical bonds or coupling means, such as, e.g., 
glutaraldehyde, N-hydroxysuccinimide esters, bifunctional maleimides, N,N— 

20 dicyclohexylcarbodiimide (DCC) or N,N— diisopropylcarbodiimide (DIC). Linking groups 
that can be an alternative to the traditional amide bond ("peptide bond") linkages include, e.g., 
ketomethylene (e.g., -C(=0)-CH 2 - for -C(O)-NH-), aminomethylene (CH 2 -NH), ethylene, 
olefin (CH=CH), ether (CH 2 ~0), thioefher (CH 2 -S), tetrazole (CN 4 -), thiazole, retroamide, 
thioamide, or ester (see, e.g., Spatola (1983) in Chemistry and Biochemistry of Amino Acids, 

25 Peptides and Proteins, Vol. 7, pp 267-357, A Peptide Backbone Modifications, Marcell Dekker, 
NY). 

A polypeptide can also be characterized as a mimetic by containing all or some 
non-natural residues in place of naturally occurring amino acid residues. Nonnatural residues 
are well described in the scientific and patent literature; a few exemplary nonnatural 
30 compositions useful as mimetics of natural amino acid residues and guidelines are described 
below. 

Mimetics of aromatic amino acids can be generated by replacing by, e.g., D- or 
L- naphylalanine; D- or L- phenylglycine; D- or L-2 thieneylalanine; D- or L-l, -2, 3-, or 4- 
pyreneylalanine; D- or L-3 thieneylalanine; D- or L-(2-pyridinyl)-alanine; D- or L-(3- 



pyridinyl)-alanine; D- or L-(2-pyrazinyl)-alanine; D- or L-(4-isopropyl)-phenylglycine; D- 
(trifluoromethyl)-phenylglycine; D-(trifluoromethyl)-phenylalanine;D-p-fluorophenylalaiiine; 
D- or L-p-biphenylphenylalanine; K- or L-p-methoxybiphenylphenylalanine; D- or L-2- 
indole(alkyl)alanines; and, D- or L-alkylainines, where alkyl can be substituted or unsubstituted 
5 methyl, ethyl, propyl, hexyl, butyl, pentyl, isopropyl, iso-butyl, sec-isotyl, iso-pentyl, or a non- 
acidic amino acids. Aromatic rings of a nonnatural amino acid include, e.g., thiazolyl, 
thiophenyl, pyrazolyl, benzimidazolyl, naphthyl, furanyl, pyrrolyl, and pyridyl aromatic rings. 

Mimetics of acidic amino acids can be generated by substitution by, e.g., non- 
carboxylate amino acids while maintaining a negative charge; (phosphono)alanine; sulfated 
1 0 threonine. Carboxyl side groups (e.g., aspartyl or glutamyl) can also be selectively modified by 
reaction with carbodiimides (R— N-C-N-R=) such as, e.g., l-cyclohexyl-3(2-morpholinyl-(4- 
ethyl) carbodiimide or l-ethyl-3(4-azonia- 4,4- dimetholpentyl) carbodiimide. Aspartyl or 
glutamyl can also be converted to asparaginyl and glutaminyl residues by reaction with 
ammonium ions. 

15 Mimetics of basic amino acids can be generated by substitution with, e.g., (in 

addition to lysine and arginine) the amino acids ornithine, citrulline, or (guanidino)-acetic acid, 
or (guanidino)alkyl-acetic acid, where alkyl is defined above. Nitrile derivative (e.g., 
containing the CN-moiety in place of COOH) can be substituted for asparagine or glutamine. 
Asparaginyl and glutaminyl residues can be deaminated to the corresponding aspartyl or 

20 glutamyl residues. 

Arginine residue mimetics can be generated by reacting arginyl with, e.g., one or 
more conventional reagents, including, e.g., phenylglyoxal, 2,3-butanedione, 1,2- 
cyclohexanedione, or ninhydrin, preferably under alkaline conditions. 

Tyrosine residue mimetics can be generated by reacting tyrosyl with, e.g., 

25 aromatic diazonium compounds or tetranitromethane. N-acetylimidizol and tetranitromethane 
can be used to form O-acetyl tyrosyl species and 3-nitro derivatives, respectively. 

Cysteine residue mimetics can be generated by reacting cysteinyl residues with, 
e.g., alpha-haloacetates such as 2-chloroacetic acid or chloroacetamide and corresponding 
amines; to give carboxymethyl or carboxyamidomethyl derivatives. Cysteine residue mimetics 

30 can also be generated by reacting cysteinyl residues with, e.g., bromo-trifluoroacetone, alpha- 
bromo-beta-(5-imidozoyl) propionic acid; chloroacetyl phosphate, N-alkylmaleimides, 3-nitro- 
2-pyridyl disulfide; methyl 2-pyridyl disulfide; p-chloromercuribenzoate; 2-chloromercuri-4 
nitrophenol; or, chloro-7-nitrobenzo-oxa-l,3-diazole. 

Lysine mimetics can be generated (and amino terminal residues can be altered) 



by reacting lysinyl with, e.g., succinic or other carboxylic acid anhydrides. Lysine and other 
alpha-amino-containing residue mimetics can also be generated by reaction with imidoesters, 
such as methyl picolinimidate, pyridoxal phosphate, pyridoxal, chloroborohydride, 
trinitrobenzenesulfonic acid, Omethylisourea, 2,4, pentanedione, and transamidase-catalyzed 
5 reactions with glyoxylate. 

Mimetics of methionine can be generated by reaction with, e.g., methionine 
sulfoxide. Mimetics of proline include, e.g., pipecolic acid, thiazolidine carboxylic acid, 3- or 
4- hydroxy proline, dehydroproline, 3- or 4-methylproline, or 3,3,-dimethylproline. Histidine 
residue mimetics can be generated by reacting histidyl with, e.g., diethylprocarbonate or para- 
1 0 bromophenacyl bromide. 

Other mimetics include, e.g., those generated by hydroxylation of proline and 
lysine; phosphorylation of the hydroxyl groups of seryl or threonyl residues; methylation of the 
alpha-ainino groups of lysine, arginine and histidine; acetylation of the N-terminal amine; 
methylation of main chain amide residues or substitution with N-methyl amino acids; or 
1 5 amidation of C-terminal carboxyl groups. 

A component of a natural polypeptide (e.g., a PL polypeptide or PDZ 
polypeptide) can also be replaced by an amino acid (or peptidomimetic residue) of the opposite 
chirality. Thus, any amino acid naturally occurring in the L-configuration (which can also be 
referred to as the R or S, depending upon the structure of the chemical entity) can be replaced 
20 with the amino acid of the same chemical structural type or a peptidomimetic, but of the 
opposite chirality, generally referred to as the D- amino acid, but which can additionally be 
referred to as the R- or S- form. 

The mimetics of the invention can also include compositions that contain a 
structural mimetic residue, particularly a residue that induces or mimics secondary structures, 
25 such as a beta turn, beta sheet, alpha helix structures, gamma turns, and the like. For example, 
substitution of natural amino acid residues with D-amino acids; N-alpha-methyl amino acids; 
C-alpha-methyl amino acids; or dehydroamino acids within a peptide can induce or stabilize 
beta turns, gamma turns, beta sheets or alpha helix conformations. Beta turn mimetic structures 
have been described, e.g., by Nagai (1985) Tet. Lett. 26:647-650; Feigl (1986) J. Amer. Chem. 
30 Soc. 108: 181-182; Kahn (1988) J. Amer. Chem. Soc. 1 10:1638-1639; Kemp (1988) Tet. Lett. 
29:5057-5060; Kahn (1988) J. Molec. Recognition 1 :75-79. Beta sheet mimetic structures have 
been described, e.g., by Smith (1 992) J. Amer. Chem. Soc. 1 14: 10672-10674. For example, a 
type VI beta turn induced by a cis amide surrogate, 1,5-disubstituted tetrazol, is described by 
Beusen (1 995) Biopolymers 36:1 8 1-200. Incorporation of achiral omega-amino acid residues 
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to generate polymethylene units as a substitution for amide bonds is described by Banerjee 
(1996) Biopolymers 39:769-777. Secondary structures of polypeptides can be analyzed by, 
e.g., high-field 1H NMR or 2D NMR spectroscopy, see, e.g., Higgins (1997) J. Pept. Res. 
50:421-435. See also, Hruby (1997) Biopolymers 43:219-266, Balaji, et al., U.S. Pat. No. 
5 5,612,895. 

As used herein, "peptide variants 55 and "conservative amino acid substitutions' 5 
refer to peptides that differ from a reference peptide (e.g., a peptide having the sequence of the 
carboxy-terminus of a specified PL protein) by substitution of an amino acid residue having 

10 similar properties (based on size, polarity, hydrophobicity, and the like). Thus, insofar as the 
compounds that are encompassed within the scope of the invention are partially defined in 
terms of amino acid residues of designated classes, the amino acids may be generally 
categorized into three main classes: hydrophilic amino acids, hydrophobic amino acids and 
cysteine-like amino acids, depending primarily on the characteristics of the amino acid side 

1 5 chain. These main classes may be further divided into subclasses. Hydrophilic amino acids 
include amino acids having acidic, basic or polar side chains and hydrophobic amino acids 
include amino acids having aromatic or apolar side chains. Apolar amino acids may be further 
subdivided to include, among others, aliphatic amino acids. The definitions of the classes of 
amino acids as used herein are as follows: 

20 " Hydrophobic Amino Acid " refers to an amino acid having a side chain that is 

uncharged at physiological pH and that is repelled by aqueous solution. Examples of 
genetically encoded hydrophobic amino acids include He, Leu and Val. Examples of non- 
genetically encoded hydrophobic amino acids include t-BuA. 

" Aromatic Amino Acid " refers to a hydrophobic amino acid having a side chain 

25 containing at least one ring having a conjugated 7i-electron system (aromatic group). The 
aromatic group may be further substituted with groups such as alkyl, alkenyl, alkynyl, 
hydroxyl, sulfanyl, nitro and amino groups, as well as others. Examples of genetically encoded 
aromatic amino acids include Phe, Tyr and Trp. Commonly encountered non-genetically 
encoded aromatic amino acids include phenylglycine, 2-naphthylalanine, P-2-thienylalanine, 

30 l,2,3,4-tetrahydroisoquinoline-3-carboxylic acid, 4-chloro-phenylalanine, 2-fluorophenyl- 
alanine, 3-fluorophenylalanine and 4-fluorophenylalanine. 

" Apolar Amino Acid " refers to a hydrophobic amino acid having a side chain 
that is generally uncharged at physiological pH and that is not polar. Examples of genetically 

11 



encoded apolar amino acids include Gly, Pro and Met. Examples of non-encoded apolar amino 
acids include Cha. 

" Aliphatic Amino Acid " refers to an apolar amino acid having a saturated or 
unsaturated straight chain, branched or cyclic hydrocarbon side chain. Examples of genetically 
5 encoded aliphatic amino acids include Ala, Leu, Val and He. Examples of non-encoded 
aliphatic amino acids include Nle. 

" Hydrophilic Amino Acid " refers to an amino acid having a side chain that is 
attracted by aqueous solution. Examples of genetically encoded hydrophilic amino acids 
include Ser and Lys. Examples of non-encoded hydrophilic amino acids include Cit and hCys. 

1 0 " Acidic Amino Acid " refers to a hydrophilic amino acid having a side chain pK 

value of less than 7. Acidic amino acids typically have negatively charged side chains at 
physiological pH due to loss of a hydrogen ion. Examples of genetically encoded acidic amino 
acids include Asp and Glu. 

" Basic Amino Acid " refers to a hydrophilic amino acid having a side chain pK 

15 value of greater than 7. Basic amino acids typically have positively charged side chains at 
physiological pH due to association with hydronium ion. Examples of genetically encoded 
basic amino acids include Arg, Lys and His. Examples of non-genetically encoded basic amino 
acids include the non-cyclic amino acids ornithine, 2,3-diaminopropionic acid, 2,4- 
diaminobutyric acid and homoarginine. 

20 " Polar Amino Acid " refers to a hydrophilic amino acid having a side chain that 

is uncharged at physiological pH, but which has a bond in which the pair of electrons shared in 
common by two atoms is held more closely by one of the atoms. Examples of genetically 
encoded polar amino acids include Asx and Glx. Examples of non-genetically encoded polar 
amino acids include citrulline, N-acetyl lysine and methionine sulfoxide. 

25 " Cvsteine-Like Amino Acid " refers to an amino acid having a side chain capable 

of forming a covalent linkage with a side chain of another amino acid residue, such as a 
disulfide linkage. Typically, cysteine-like amino acids generally have a side chain containing 
at least one thiol (SH) group. Examples of genetically encoded cysteine-like amino acids 
include Cys. Examples of non-genetically encoded cysteine-like amino acids include 

30 homocysteine and penicillamine. 

As will be appreciated by those having skill in the art, the above classification 
are not absolute ~ several amino acids exhibit more than one characteristic property, and can 
therefore be included in more than one category. For example, tyrosine has both an aromatic 
ring and a polar hydroxyl group. Thus, tyrosine has dual properties and can be included in both 
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the aromatic and polar categories. Similarly, in addition to being able to form disulfide 
linkages, cysteine also has apolar character. Thus, while not strictly classified as a hydrophobic 
or apolar amino acid, in many instances cysteine can be used to confer hydrophobicity to a 
peptide. 

5 Certain commonly encountered amino acids which are not genetically encoded 

of which the peptides and peptide analogues of the invention may be composed include, but are 
not limited to, P-alanine (b-Ala) and other omega-amino acids such as 3-aminopropionic acid 
(Dap), 2,3-diaminopropionic acid (Dpr), 4-aminobutyric acid and so forth; a-aminoisobutyric 
acid (Aib); e-aminohexanoic acid (Aha); 5-amino valeric acid (Ava); N-methylglycine or 

10 sarcosine (MeGly); ornithine (Orn); citrulline (Cit); t-butylalanine (t-BuA); t-butylglycine 
(t-BuG); N-methylisoleucine (Melle); phenylglycine (Phg); cyclohexylalanine (Cha); 
norleucine (Nle); 2-naphthylalanine (2-Nal); 4-chlorophenylalanine (Phe(4-Cl)); 
2-fluorophenylalanine (Phe(2-F)); 3-fluorophenylalanine (Phe(3-F)); 4-fluorophenylalanine 
(Phe(4-F)); penicillamine (Pen); l,2,3,4-tetrahydroisoquinoline-3-carboxylic acid (Tic); p-2- 

15 thienylalanine (Thi); methionine sulfoxide (MSO); homoarginine (hArg); N-acetyl lysine 
(AcLys); 2,3-diaminobutyric acid (Dab); 2,3-diaminobutyric acid (Dbu); p-aminophenylalanine 
(Phe(pNH 2 )); N-methyl valine (MeVal); homocysteine (hCys) and homo serine (hSer). These 
amino acids also fall conveniently into the categories defined above. 

The classifications of the above-described genetically encoded and non-encoded 

20 amino acids are summarized in TABLE 1, below. It is to be understood that TABLE 1 is for 
illustrative purposes only and does not purport to be an exhaustive list of amino acid residues 
which may comprise the peptides and peptide analogues described herein. Other amino acid 
residues which are useful for making the peptides and peptide analogues described herein can 
be found, e.g., in Fasman, 1989, CRC Practical Handbook of Biochemistry and Molecular 

25 Biology, CRC Press, Inc., and the references cited therein. Amino acids not specifically 
mentioned herein can be conveniently classified into the above-described categories on the 
basis of known behavior and/or their characteristic chemical and/or physical properties as 
compared with amino acids specifically identified. 

TABLE 1 



Classification Genetically Encoded Genetically Non-Encoded 

Hydrophobic 

Aromatic F, Y, W Phg, Nal, Thi, Tic, Phe(4-Cl), Phe(2-F), 

Phe(3-F), Phe(4-F), Pyridyl Ala, 
Benzothienyl Ala 
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Apolar 
Aliphatic 

Hydrophilic 

Acidic 

Basic 

Polar 
Cysteine-Like 



M, G,P 
A, V, L, I 

D. E 

H, K,R 

Q, N, S, T, Y 

C 



t-BuA, t-BuG 5 Melle, Nle, MeVal, Cha, 
bAla, MeGly, Aib 



Dpr, Orn, hArg, Phe(p-NH 2 ), DBU, A 2 BU 

Cit, AcLys ; MSO ; hSer 

Pen, hCys, p-methyl Cys 



As used herein, a "detectable label" has the ordinary meaning in the art and 
refers to an atom (e.g., radionuclide), molecule (e.g., fluorescein), or complex, that is or can be 
5 used to detect (e.g., due to a physical or chemical property), indicate the presence of a molecule 
or to enable binding of another molecule to which it is covalently bound or otherwise 
associated. The term "label 55 also refers to covalently bound or otherwise associated molecules 
(e.g., a biomolecule such as an enzyme) that act on a substrate to produce a detectable atom, 
molecule or complex. Detectable labels suitable for use in the present invention include any 

10 composition detectable by spectroscopic, photochemical, biochemical, immunochemical, 
electrical, optical or chemical means. Labels useful in the present invention include biotin for 
staining with labeled streptavidin conjugate, magnetic beads (e.g., DynabeadsTM), fluorescent 
dyes (e.g., fluorescein, Texas red, rhodamine, green fluorescent protein, enhanced green 
fluorescent protein, and the like), radiolabels (e.g., 3 H, 125 I, 35 S, 14 C, or 32 P), enzymes ( e.g., 

1 5 hydrolases, particularly phosphatases such as alkaline phosphatase, esterases and glycosidases, 
or oxidoreductases, particularly peroxidases such as horse radish peroxidase, and others 
commonly used in ELISAs), substrates, cofactors, inhibitors, chemiluminescent groups, 
chromogenic agents, and colorimetric labels such as colloidal gold or colored glass or plastic 
(e.g., polystyrene, polypropylene, latex, etc.) beads. Patents teaching the use of such labels 

20 include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; 
and 4,366,241. Means of detecting such labels are well known to those of skill in the art. 
Thus, for example, radiolabels and chemiluminescent labels may be detected using 
photographic film or scintillation counters, fluorescent markers may be detected using a 
photodetector to detect emitted light (e.g., as in fluorescence-activated cell sorting). Enzymatic 

25 labels are typically detected by providing the enzyme with a substrate and detecting the reaction 
product produced by the action of the enzyme on the substrate, and colorimetric labels are 
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detected by simply visualizing the colored label. Thus, a label is any composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical 
means. The label may be coupled directly or indirectly to the desired component of the assay 
according to methods well known in the art. Non-radioactive labels are often attached by 
5 indirect means. Generally, a ligand molecule (e.g., biotin) is covalently bound to the molecule. 
The ligand then binds to an anti-ligand (e.g., streptavidin) molecule which is either inherently 
detectable or covalently bound to a signal generating system, such as a detectable enzyme, a 
fluorescent compound, or a chemiluminescent compound. A number of ligands and anti- 
ligands can be used. Where a ligand has a natural anti-ligand, for example, biotin, thyroxine, 

10 and Cortisol, it can be used in conjunction with the labeled, naturally occurring anti-ligands. 
Alternatively, any haptenic or antigenic compound can be used in combination with an 
antibody. The molecules can also be conjugated directly to signal generating compounds, e.g., 
by conjugation with an enzyme or fluorophore. Means of detecting labels are well known to 
those of skill in the art. Thus, for example, where the label is a radioactive label, means for 

1 5 detection include a scintillation counter, photographic film as in autoradiography, or storage 
phosphor imaging. Where the label is a fluorescent label, it may be detected by exciting the 
fluorochrome with the appropriate wavelength of light and detecting the resulting fluorescence. 
The fluorescence may be detected visually, by means of photographic film, by the use of 
electronic detectors such as charge coupled devices (CCDs) or photomultipliers and the like. 

20 Similarly, enzymatic labels may be detected by providing the appropriate substrates for the 
enzyme and detecting the resulting reaction product. Also, simple colorimetric labels may be 
detected by observing the color associated with the label. It will be appreciated that when pairs 
of fluorophores are used in an assay, it is often preferred that they have distinct emission 
patterns (wavelengths) so that they can be easily distinguished. 

25 

As used herein, the term "substantially identical" in the context of comparing 
amino acid sequences, means that the sequences have at least about 70%, at least about 80%, or 
at least about 90% amino acid residue identity when compared and aligned for maximum 
correspondence. An algorithm that is suitable for determining percent sequence identity and 
30 sequence similarity is the FASTA algorithm, which is described in Pearson, W.R. & Lipman, 
D.J., 1988, Proc. Natl Acad, Set U.S.A. 85: 2444. See also W. R. Pearson, 1996, Methods 
Enzymol 266: 227-258. Preferred parameters used in a FASTA alignment of DNA sequences 
to calculate percent identity are optimized, BL50 Matrix 15: -5, k-tuple = 2; joining penalty = 
40, optimization = 28; gap penalty -12, gap length penalty =-2; and width = 16. 
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As used herein, "hematopoietic cells" include leukocytes including lymphocytes 
(T cells, B cells and NK cells), monocytes, and granulocytes (i.e., neutrophils, basophils and 
eosinophils), macrophages, dendritic cells, megakaryocytes, reticulocytes, erythrocytes, and 
5 CD34 + stem cells. 

As used herein, the terms "test compound" or "test agent" are used 
interchangeably and refer to a candidate agent that may have enhancer/agonist, or 
inhibitor/antagonist activity, e.g., inhibiting or enhancing an interaction such as PDZ-PL 

1 0 binding. The candidate agents or test compounds may be any of a large variety of compounds, 
both naturally occurring and synthetic, organic and inorganic, and including polymers (e.g., 
oligopeptides, polypeptides, oligonucleotides, and polynucleotides), small molecules, 
antibodies (as broadly defined herein), sugars, fatty acids, nucleotides and nucleotide analogs, 
analogs of naturally occurring structures (e.g., peptide mimetics, nucleic acid analogs, and the 

15 like), and numerous other compounds. In certain embodiment, test agents are prepared from 
diversity libraries, such as random or combinatorial peptide or non-peptide libraries. Many 
libraries are known in the art that can be used, e.g., chemically synthesized libraries, 
recombinant (e.g. , phage display libraries), and in vitro translation-based libraries. Examples of 
chemically synthesized libraries are described in Fodor et ah, 1991, Science 251:767-773; 

20 HoughtenetaL, 1991, Nature 354:84-86; Lam etal., 1991, Nature 354:82-84; Medynski, 1994, 
Bio/Technology 12:709-710; Gallop et al., 1994, J. Medicinal Chemistry 37(9): 1233-1251; 
Ohlmeyer et al., 1993, Proc. Natl. Acad. Sci USA 90: 10922-10926; Erb et al, 1994, Proa Natl. 
Acad. Sci. USA 91: 11422-1 1426; HoughtenetaL, 1992, Biotechniques 13:412; Jayawickreme 
et al., 1994, Proc. Natl Acad. Set USA 91 : 1614-1618; Salmon et al., 1993, Proc. Natl. Acad. 

25 Sci. USA 90:1 1708-1 1712; PCT Publication No. WO 93/20242; and Brenner and Lerner, 1992, 
Proc. Natl. Acad. Sci. USA 89:5381-5383. Examples of phage display libraries are described in 
Scott and Smith, 1990, Science 249:386-390; Devlin et al, 1990, Science, 249:404-406; 
Christian, R.B., et al, 1992, J. Mol. Biol 227:711-718); Lenstra, 1992, J. Immunol Meth. 
1 52: 149-1 57; Kay et al., 1993, Gene 128:59-65; and PCT Publication No. WO 94/18318 dated 

30 August 18, 1994. In vitro translation-based libraries include but are not limited to those 
described in PCT Publication No. WO 91/05058 dated April 18, 1991; and Mattheakis et al., 
1 994, Proc. Natl Acad. Sci. USA 9 1 :9022-9026. By way of examples of nonpeptide libraries, a 
benzodiazepine library (see e.g., Bunin et al, 1994, Proc. Natl Acad. Sci. USA 91 :4708-4712) 
can be adapted for use. Peptoid libraries (Simon et al., 1992, Proc. Natl Acad. Sci. USA 
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89:9367-9371) can also be used. Another example of a library that can be used, in which the 
amide functionalities in peptides have been permethylated to generate a chemically transformed 
combinatorial library, is described by Ostresh et al. (1994, Proc. Natl Acad. Sci. USA 
91:11138-11142). 

5 

The term "specific binding 5 ' refers to binding between two molecules, for 
example, a ligand and a receptor, characterized by the ability of a molecule (ligand) to associate 
with another specific molecule (receptor) even in the presence of many other diverse molecules, 
i.e., to show preferential binding of one molecule for another in a heterogeneous mixture of 
10 molecules. Specific binding of a ligand to a receptor is also evidenced by reduced binding of a 
detectably labeled ligand to the receptor in the presence of excess unlabeled ligand (i.e., a 
binding competition assay). 

As used herein, a "plurality" of PDZ proteins (or corresponding PDZ domains or 
PDZ fusion polypeptides) has its usual meaning. In some embodiments, the plurality is at least 
5, and often at least 25, at least 40, or at least 60 different PDZ proteins. In some embodiments, 
the plurality is selected from the list of PDZ polypeptides listed in TABLE 6. In some 
embodiments, the plurality of different PDZ proteins are from (i.e., expressed in) a particular 
specified tissue or a particular class or type of cell. In some embodiments, the plurality of 
different PDZ proteins represents a substantial fraction (e.g., typically at least 50%, more often 
at least 80%) of all of the PDZ proteins known to be, or suspected of being, expressed in the 
tissue or cell(s), e.g., all of the PDZ proteins known to be present in lymphocytes or 
hematopoetic cells. In some embodiments, the plurality is at least 50%, usually at least 80%, at 
least 90% or all of the PDZ proteins disclosed herein as being expressed in a particular cell. 

When referring to PL peptides (or the corresponding proteins, e.g., 
corresponding to those listed in TABLE 3, or elsewhere herein) a "plurality" may refer to at 
least 5, at least 10, and often at least 25 PLs such as those specifcally listed herein, or to the 
classes and percentages set forth supra for PDZ domains. 

30 II. Overview 

The present inventors have identified a large number of interactions between 
PDZ proteins and GPCR proteins containing a PL motif. These interactions can play a 
significant role in the biological function of certain cells and in a variety of physiological 
systems. As used herein, the term "biological function" in the context of a cell, refers to a 
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detectable biological activity normally carried out by the cell, e.g., a phenotypic change such as 
proliferation, cell activation (e.g., T cell activation, B cell activation, T-B cell conjugate 
formation), cytokine release, degranulation, tyrosine phosphorylation, ion (e.g., calcium) flux, 
metabolic activity, apoptosis, changes in gene expression, maintenance of cell structure, cell 
5 migration, adherence to a substrate, signal transduction, cell-cell interactions, and others 
described herein or known in the art. 

Because the interactions involve proteins which are involved in diverse 
physiological systems (see Background section supra), the methods provided herein can be 
utilized to broadly or selectively modulate a number of different biological functions. Methods 

10 are also disclosed herein for determining whether a test compound acts as a modulator of 
binding between a particular PDZ protein and PL protein binding pair. Both agonists and 
antagonists of the binding pairs can be identified by such screening methods. Modulators so 
identified can subsequently be formulated as a pharmaceutical composition and used in the 
treatment of various diseases that are correlated with binding between a particular PDZ protein 

1 5 and PL protein or set of such proteins. 

III. PDZ Protein and PL Protein Interactions 

TABLE 2 lists PDZ proteins and GPCR-PL proteins which the current inventors 
have identified as binding to one another. Each page of TABLE 2 includes six columns. The 

20 columns are numbered from left to right such that the left-most column is column 1 and the 
right-most column is column 6. Thus, the first column is labeled "AVC PL ID" and lists AA 
numbers that serve as unique internal designations for each PL peptide. These ID numbers 
correspond to those listed in column 6 of TABLE 3. The second column is labeled "PL" and 
lists the various PL proteins/PDZ-Ligands that were examined. This column lists gene 

25 abbreviations, with full gene names listed in parentheses, for peptides corresponding to the 
carboxyl-terminal 20 amino acids of the protein listed. The third column, labeled "PL 20mer 
Sequence," lists the carboxyl-terminal 20 amino acids of the protein. All ligands are 
biotinylated at the amino-terminus. Some have been modified to eliminate cysteine amino 
acids from the 20mer sequence. Modifications have been noted in column 2, and wildtype 

30 sequences are presented in TABLE 3. 

The PDZ protein (or proteins) that interact(s) with a PL peptide are listed in the 
fourth column that is labeled "PDZ". This column provides the gene name for the PDZ portion 
of the GST-PDZ fusion that interacts with the PDZ-ligand to the left. For PDZ domain- 
containing proteins with multiple domains the domain number is listed to the right of the PDZ 
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(i.e., in column 5 labeled "PDZ Domain"), and indicates the PDZ domain number when 
numbered from the amino-terminus to the carboxy-terminus. 

The sixth column labeled "Binding Strength" is another measure of the level of 
binding. In particular, it provides an absorbance value at 450 nm which indicates the amount of 
5 PL peptide bound to the PDZ protein. The following numerical values have the following 
meanings: '1' - A450nm 0-1; '2' - A450nm 1-2; c 3'- A450nm 2-3; £ 4 5 - A450nm 3-4; '5 5 - 
A450nm of 4. All interactions have been repeated a total of at least 4 times, and all show 
A450nm values that are at least two times that of controls. 

10 

Further information regarding these PL proteins and PDZ proteins is provided in 
TABLES 3 and 6. In particular, TABLE 3 provides a partial listing of known G-protein 
coupled receptors, along with the amino acid sequence of the carboxyl-terminal 20 amino 
acids.. When numbered from left to right, the first column labeled "Gene Name(Synonyms)" 

15 provides the most commonly used name for that gene, with synonyms or acronyms listed in 
parentheses. Genbank reference numbers (Accession number and GI number) are listed in 
column 2, labeled "Genbank Reference." Columns 3 and 4, labeled "Last 20 aa" and "Last 4 
aa," respectively, list the last 20 amino acids, and the last 4 amino acids of each protein. 
Column 5, labeled "PL?" marks with an "X" those carboxy-terminal sequences that are 

20 predicted to display a classic PL amino acid motif. Many of the carboxyl-terminal motifs that 
are not marked in column 6 may exhibit binding to PDZ proteins, and the designation as a 
classic PL motif is in no way intended to predict or restrict GPCR binding patterns to PDZ 
proteins. The sixth column labeled "AVC PL ID" provides the internal designation number 
used to refer to a particular PL protein and correlates with the designation used in column 1 of 

25 TABLE 2. Those PL proteins that have been assigned an internal AVC ID. 

Many of the genes listed in TABLE 3 express more than one amino acid 
sequence, depending on alternative exon splicing and single amino acid changes. The Genbank 
reference presented is intended to represent one isoform of the gene listed. Alternatively 
spliced and point mutated isoforms of the genes listed in TABLE 3 are given separate Genbank 

30 references, and have been omitted for the purpose of brevity. In all cases, the carboxyl-terminal 
sequence is unaffected by these changes. In cases where the carboxyl-terminal sequence is 
affected by point mutations or alternative splicing, both isoforms have been included. In 
addition, those GPCR's for which no known receptor ligand has been identified (so-called 
"orphan GPCR's") have also been omitted. As indicated supra, all peptides are biotinylated at 
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the amino terminus and the amino acid sequences correspond to the C-terminal sequence of the 
gene name listed to the left. 

TABLE 6 lists the sequences of the PDZ domains cloned into a vector (PGEX- 
3X vector) for production of GST-PDZ fusion proteins (Pharmacia). More specifically, the 
5 first column (left to right) entitled "PDZ Gene Name" lists the name of the gene containing the 
PDZ domain. The second column labeled "GI or Acc#" is a unique Genbank identifier for the 
gene used to design primers for PCR amplification of the listed sequence. The next column 
labeled "Domain#" indicates the Pfam-predicted PDZ domain number, as numbered from the 
ammo-terminus of the gene to the carboxy- terminus. The last column entitled "Sequence fused 

10 to GST construct" provides the actual amino acid sequence inserted into the GST-PDZ 
expression vector as determined by DNA sequencing of the constructs. 

As discussed in detail herein, the PDZ proteins listed in TABLES 2 and 6 are 
naturally occurring proteins containing a PDZ domain. Only significant interactions are 
presented in this table. Thus, the present invention is particularly directed to the detection and 

1 5 modulation of interactions between a PDZ protein and PL protein pair listed in TABLE 2. As 
used herein the phrase "protein pair" or 'protein binding pair" when used in reference to a PDZ 
protein and PL protein refers to a PD protein and PDZ protein listed in TABLE 2 which bind to 
one another. It should be understood that TABLE 2 is set up to show that certain PL proteins 
bind to a plurality of PDZ proteins. For example, PL protein AA329 binds to PDZ proteins 

20 KIAA0973 and KIAA0807. 

Interactions between GPCR proteins containing a PDZ ligand and PDZ proteins 
are not limited to those listed in TABLE 2. TABLE 4 presents a list of interactions between 
GPCR proteins and PDZ proteins. When numbered from left to right, the first column, labeled 
"GPCR gene," lists the GPCR protein that binds to a PDZ domain-containing protein. The 

25 second column labeled "PDZ-containing Gene" lists the specific PDZ-containing gene that 
binds to the GPCR gene listed to the left. The PDZ domain that binds to the GPCR is listed in 
column 3, labeled "PDZ domain(s)." These interactions were confirmed using assays other 
than the "G" or "A" assays described infra, and suggest that changes in PDZ-PL binding 
patterns may occur with the use of different assays or with the use of assay variations described 

30 infra. 

The interactions summarized in TABLE 2 can occur in a wide variety of cell 
types. Examples of such cells include hematopoietic, stem, neuronal, muscle, epidermal, 
epithelial, endothelial, and cells from essentially any tissue such as liver, lung, placenta, uterus, 
kidney, ovaries, testes, stomach, colon and intestine. Because the interactions disclosed herein 
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can occur in such a wide variety of cell types, these interactions can also play an important role 
in a variety of biological functions. 

Thus, for example, in certain embodiments of the invention, the PL proteins of 
the invention bind a PDZ protein expressed in T lymphocytes, B lymphocytes, or both T and B 
5 lymphocytes. In an embodiment, the PL protein binds a PDZ protein expressed in endothelial 
cells. In various embodiments, the PL proteins and/or the PDZ protein to which it binds are not 
expressed in the nervous system (e.g., neurons). In still other instances the PL protein binds a 
PDZ protein that is expressed in neuronal cells. 

In various embodiments of the invention, the PL protein is expressed or up- 
1 0 regulated upon cell activation (e.g., in activated B lymphocytes, T lymphocytes) or upon entry 
into mitosis (e.g., up-regulation in rapidly proliferating cell populations). 

In certain other various embodiments of the invention, the PL protein is (i) a 
protein that mediates immune cell (e.g., hematopoietic cell) activation or migration, (ii) a 
protein that does not mediate apoptosis in a cell type, (iii) a protein that is a G-protein coupled 
1 5 seven transmembrane helix receptor but not a serotonin receptor, (iv) a protein that is G-protein 
coupled seven transmembrane helix receptor but not a cytokine receptor, or (v) a protein that is 
a G-protein coupled seven transmembrane helix receptor and is a cytokine receptor. 

20 IV. Classification of Interactions 

A. General 

The interactions summarized in TABLE 2 can occur in a wide variety of cell 
types. Examples of such cells include hematopoietic, stem, neuronal, muscle, epidermal, 

25 eipthelial, endothelial, and cells from essentially any tissue such as liver, ling, placenta, uterus, 
kidney, ovaries, testes, stomach, colon and intestine. Because the interactions disclosed herein 
can occur in such a wide variety of cell types, these interactions can also play an important role 
in a variety of biological functions. Consequently, modulation of the interactions between PDZ 
proteins and PL proteins that are described herein can be utilized to regulate biological function 

30 in a wide range of cells. 

B. Exemplary PDZ Classification 

The PDZ proteins identified herein as interacting with particular PL proteins can 
be grouped into a number of different categories. Thus, as described in greater detail below, the 
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methods and reagents that are provided herein can be utilized to modulate PDZ interactions, 
and thus biological functions, that are regulated or otherwise involve the following classes of 
proteins. It should be recognized, however, that modulation of the interactions that are 
identified herein can be utilized to affect biological functions involving other protein classes. 

5 

1. Protein Kinases 

A number of protein kinases contain PDZ domains. Protein kinases are widely 
involved in cellular metabolism and regulation of protein activity through phosphorylation of 
amino acids on proteins. An example of this is the regulation of signal transduction pathways 
10 such as T cell activation through the T cell Receptor, where ZAP-70 kinase function is required 
for transmission of the activation signal to downstream effector molecules. These molecules 
include, but are not limited to KIAA0303. KIAA0561, KIAA0807, KIAA0973, and CASK. 

2. Guanalyte Kinases 

15 A number ofguanalyte kinases contain PDZ domains. These molecules include, 

but are not limited to Atrophin 1, CARPI 1, CARD 14, DLG1, DLG2. DLG5, FLJ12615, 
MPP1, MPP2, NeDLG. p55T, PSD95, ZO-1, ZO-2, and ZO-3. 



3. Guanine Exchange Factors 

20 A number of guanine exchange factors contain PDZ domains. Guanine 

exchange factors regulate signal transduction pathways and other biological processes through 
facilitating the exchange of differently phosphorvlated guanine residues. These molecules 
include, but are not limited to GTPase. Guanine Exchange. KIAA0313, KIAA0380, 
KIAA0382. KIAA1389, KIAA1415, TIAM1, and TAIM2. 

25 

4. LIM PDZ's 

A number of LIM PDZ's contain PDZ domains. These molecules include, but 
are not limited to a-Actinin 2, ELFIN 1, ENIGMA, HEMBA 1 003 1 1 7, KIAA06 13, KIAA0858, 
KIAA0631, LIM Mystique. LIM protein, LIM-RIL, LIMK1, LIMK2, and LU-1. 

30 

5. Proteins Containing Only PDZ Domains 

A number of proteins contain PDZ domains without any other predicted 
functional domains. These include, but are not limited to 26s subunit p27, AIPC, Cytohesion 
Binding Protein, EZRIN Binding Protein, FLJ00011, FLJ20075, FLJ21687, GRIP1, 
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HEMBA1000505, KIAA0545, KIAA0967, KIAA1202, KIAA1284, KIAA1526, KIAA1620, 
KIAA1719, MAGI1, Novel PDZ gene. Outer Membrane, PAR3, PAR6, PAR6y, PDZ-73, 
PDZKL PICKL PIST, prIL16, Shankl, SIP1, SITAC-18, Syntenin, Syntrophin y2, TIP1, TIP2, 
and TIP43. 

5 

6. Tyrosine Phosphatases 

A number of Tyrosine phosphatases contain PDZ domains. Tyrosine 
phosphatases regulate biological processes such as signal transduction pathways through 
removal of phosphate groups required for function of the target protein. Examples of such 
10 enzymes include, but are not limited to PTN-3, PTN-4. and PTPLL 

7. Serine Proteases 

A number of serine proteases contain PDZ domains. Proteases affect biological 
molecules by cleaving them to either activate or repress their functional ability. These enzymes 
15 have a variety of functions, including roles in digestion, blood coagulation and lysis of blood 
clots. These include, but are not limited to Novel Serine Protease and Serine Protease. 

8. Viral Oncogene Interacting Proteins that Contain PDZ Domains 

A number of TAX interacting proteins contain PDZ domains. Many of these 
20 also bind to multiple viral oncoproteins such as Adenovirus E4. Papillomavirus E6, and HB V 
protein X. These include, but are not limited to AIPC, Connector Enhancer. DLG1, DLG2, 
ERE IN. FLJ00011, FLJ11215, HEMBA10031 17, INADL, KIAA0147, KIAA0807, 
KIAA1526, KIAA1634, LIMK1, LIM Mystique. LIM-RIL. MUPP1, NeDLG. Outer 
Membrane, PSD95, PTN-4. PTPL-1 , Syntrophin y 1 , Syntrophin y2, TAX2-like protein. TIP2, 
25 TIPl,TIP33,and TIP43. 

9. Proteins Containing RA and/or RHA and/or DIL and/or IGFBP and/or 
WW and/or L27 and/or SAM and/or PH and/or DIX and/or DIP and/or Dishevelled and/or LRR 
and/or Hormone 3 and/or C2 and/or RPH3A and/or zf-TRAF and/or zf-C3HC4 and/or PIP 

30 and/or NOSynthase and/or Flavodoxin and/or FAD binding and/or NAD binding and/or Kazal 
and/or Trypsin and/or RBD and/or RGS and/or GoLoco and/or HR1 and/or BRO 1 That Contain 
PDZ Domains 

A number of proteins containing RA and/or RHA and/or DIL and/or IGFBP 
and/or WW and/or L27 and/or SAM and/or PH and/or DIX and/or DIP and/or Dishevelled 
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and/or LRR and/or Hormone 3 and/or C2 and/or RPH3A and/or zf-TRAF and/or zf-C3HC4 
and/or PID and/or NO_Synthase and/or Flavodoxin and/or FAD binding and/or NAD binding 
and/or Kazal and/or Trypsin and/or RBD and/or RGS and/or Go Loco and/or HR1 and/or BRO 1 
contain PDZ domains. These include, but are not limited to AF6, APXL-1, MAGI1, DVL1, 
5 DVL2, DVL3, KIAA0417, KIAA0316 5 KIAA0340, KIAA0559, KIAA0751, KIAA0902, 
KIAA1095, KIAA1222, KIAA1634, MINT1, NOS1, RGS 12, Rhophilin-like, Shank 3, 
Syntrophin la, Syntrophin p2, and XI 1 p. 

C Exemplary PL Classification 

1 0 The GPCR-PL proteins involved in the interactions listed in TABLE 2 are from 

a number of different classes. Consequently, the methods and reagents that are disclosed herein 
can be utilized to modulate interactions involving the following classes of GPCR-PL proteins 
to modulate a biological function in cells, but are not intended to be limiting in scope of 
biological processes or diseases affected. The following classes, however, should not be 

15 considered exhaustive of the types of classes of GPCR proteins whose activity can be 
modulated using the methods and reagents that are provided herein. 

1. Serotonin Receptors 

Serotonin receptors are involved in a variety of physiological 

20 functions, including nociception, motor control, endocrine secretion thermoregulation, 

appetite, control of exchanges between the central nervous system and cerebospinal fluid, 
prostate cancer, hormone overproduction by endocrine tumors, migrane, irritable bowel 
syndrome, Alziemer's disease, drug withdrawls, and a number of psychological disorders, 
including but not limited to depression, obsessive compulsive disorder, schizophrenia, and 

25 anxiety.. Representative members of this family include, but are not limited to, 5-HT1 A, 5- 
HT1B, 5-HT1D, 5HT1F, 5-HT2A, 5-HT2B, 5-HT2C, 5-HT4, 5-HT5A, 5-HT6, and 5-HT7. 
Modulation of serotonin receptor interactions with PDZ proteins may provide an effective 
means for treating a number of diseases, including but not limited to those listed above. 

30 2. Histamine Receptors 

Histamine receptors are involved in histamine responses, and affect 
several systems that result in asthma, allergy and inflammation responses. In addition, 
histamine receptors have been implicated in anaphylaxis, rhinoconjunctivitis, and 
Gastroesophageal reflux disease (GERD). Representative GPCRs include, but are not 
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limited to, Histamine HI receptor, histamine H2 receptor, histamine H3 receptor, and 
histmane H4 receptor. Modulation of histamine receptor interactions with PDZ proteins 
may provide effective treatments for these and many other diseases. 



5 3. Acetylcholine Receptors 

Acetylcholine receptors are involved in activation of neurons. 
Inappropriate activation can lead to Parkinson's like symptoms in animal models, increased 
metabolic activity, increased cardiac activity, epilepsy, and psychological disorders and 
responses. Representative members include, but are not limited to, ACM1, ACM2, ACM3, 
10 ACM4 and ACM5. 

4. Adrenoceptors 
Adrenoceptors are involved a number of biological process, including 

synaptic plasticity, long term potentiation, inflammation, asthma, obesity, rheumatoid 
arthritis, overactive bladder disorder, and hypertension. In addition, these receptors have 
been implicated in heroin addiction, chronic heart failure, and other cardiovascular diseases. 
Representative members of this family include the betal-, beta2-, beta3- and beta4- 
adrenergic receptors, and the alphal- and alpha2- adrenergic receptors. Modulation of 
adrenergic receptor interactions with PDZ proteins may provide effective treatments for the 
diseases listed above and other cardiovascular diseases. 

5. Dopamine Receptors 
Dopamine receptors are known to be essential for normal 

neurotransmission. Abnormalities in dopamine receptor function or localization can result in a 
number of neurological diseases, including but not limited to Parkinson's disease, 
schizophrenia, and Attention Deficit Hyperactivity Disorder (ADHD). Representative members 
of this group include but are not limited to Dopamine 1 receptor, Dopamine 2 receptor, 
Dopamine 3 receptor, Dopamine 4 receptor, and Dopamine 5 receptor. Modulation of 
dopamine receptor interactions with PDZ proteins may provide effective treatments for a 
variety of neurological disorders, including those listed above. 

6. Bradykinin Receptors 
Bradykinin receptors are involved in a number of biological functions, 

including inflammation, tissue injury, asthma, perennial rhinitis, diabetes, and brain edema. 

25 



15 



20 



25 



30 



Bradykinin receptors have also been implicated in various cardiovascular diseases, including 
hypertension, myocardial hypertrophy, myocardial infarction, and arrhythmias. Representative 
members of this group include but are not limited to B 1 bradykinin receptor and B2 bradykinin 
receptor. Modulation of bradykinin receptor interactions with PDZ proteins may provide 
5 effective treatments for many diseases, including those listed above. 

7. Anaphylatoxin Chemotactic Receptors 

Anaphylatoxin chemotactic receptors and their homologues form a 
group that is highly involved in the inflammatory response, and is involved in other 
10 biological functions to a lesser degree. Representative members of this group include C5a 
anaphylatoxin chemotactic receptor and C3a anaphylatoxin chemotactic receptor. 
Modulation of anaphylatoxin chemotactic receptor interactions with PDZ proteins may 
provide potent therapies for inflammation. 

15 8. Interleukin 8 Receptors 

Interleukin 8 receptors play a role in lung disease, multiple myeloma, 
and inflammation. Representative members of this group include IL8RA and IL8RB. 
Modulation of interleukin receptor interactions with PDZ proteins may provide effective 
treatments for these and other diseases. 

20 

9. Fmet-leu-phe Receptors 

Fmet-leu-phe (FMLF or FMLP) receptors are receptors to 
chemoattractants, and thus are highly involved in inflammation, tissue injury and repair, and 
phygocytosis of foreign bacteria or microbes. Representative members of this group include 
25 but are not limited to FMLP receptor I and FMLP receptor II. Modulation of FMLP receptor 
interactions with PDZ proteins may provide effective means for regulating chemotaxis and 
inflammation, and for treating bacterial or viral infections. 

10. Angiotensin Receptors 

30 Angiotensin receptors are known to be involved in diabetes, 

hypertension, cardiovascular disease, renal disease, proteinuria and other diseases. 
Representative members of this group include but are not limited to type 2 angiotensin II 
receptor, type 1 A angiotensin II receptor, and type IB angiotensin II receptor. Modulation 
of angiotensin receptor interactions with PDZ proteins may provide effective treatments for 
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many diseases, including those listed above. 



11. Endothelin Receptors 

Endothelin receptors play a role in a variety of biological functions, 
5 including a major role in the female reproductive cycle. In addition, these receptors have been 
implicated in a number of diseases, including glaucoma, hypertension, congestive heart failure, 
and cerebral vasospasm. Representative members of this group include but are not limited to 
endothelin A receptor and endothelin B receptor. Modulation of endothelin receptor 
interactions with PDZ proteins may provide effective treatments for a variety of diseases, 
1 0 including those listed above. 

12. Melanocortin Receptors 

Melanocortin receptors are known to be involved in obesity, anorexia 
nervosa, nociception, and a variety of other biological processes or disorders. Representative 
1 5 members of this group include but are not limited to adrenocorticotropic hormone receptor, 
melanocortin receptor 2, melanocortin receptor 3, melanocortin receptor 4, melanocortin 
receptor 5, and melanocyte stimulating hormone receptor. Modulation of melanocortin 
receptor interactions with PDZ proteins may provide effective treatment for diseases such as 
obesity and anorexia nervosa. 

20 

13. Neuropeptide Y Receptors 

Neuropeptide Y receptors are known to be involved in a number of 
biological functions and diseases, including stress, cardiovascular disease, feeding disorders, 
seizures, hypertension, obesity, anxiety, diabetes, and intestinal disorders. Representative 
25 members of this group include but are not limited to Neuropeptide Y receptor type 1, 
Neuropeptide Y receptor type 2. Neuropeptide Y receptor type 4, and Neuropeptide Y receptor 
type 5. Modulation of neuropeptide receptor interactions with PDZ proteins may provide 
effective treatments for those diseases listed above and many others. 

30 14. Neurotensin Receptors 

Neurotensin receptors are involved in a variety of diseases, including 
psychological disorders such as Parkinson's disease and schizophrenia. Representative 
members of this group include but are not limited to neurotensin receptor type 1 and 
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neurotensin receptor type 2. Modulation of neurotensin receptor interactions with PDZ proteins 
may provide effective treatment for psychological disorders and other diseases. 

15. Opioid Receptors 

5 Opioid receptors are involved in a variety of diseases, including but not 

limited to polycystic ovarian syndrome, irritable bowel syndrome, heroin addiction, and ileus. 
Representative members of this group include mu-opioid receptor, delta-opioid receptor, kappa- 
opioid receptor, and nociceptin receptor. Modulation of opioid receptor interactions with PDZ 
proteins may provide effective treatments for these and many other diseases. 

10 

16. Somatostatin Receptors 

Somatostatin receptors are involved in the modulation of endocrine 
and exocrine functions in both nervous and non-nervous tissues, and plays a role in obesity, 
diabetes mellitus, acromegaly, and many other diseases. Representative members of this 
15 family include sstl, sst2A, sst2B, sst3, sst4 and sst5. Modulation of somatostatin receptor 
interactions with PDZ proteins may provide effective treatments for these diseases and 
various cancers, due to somatostatin receptor overexpression on many types of tumors. 

17. Tachykinin Receptors 

20 Tachykinin receptors are involved in a number of diseases and disorders, such as 

incontinence, migrane, fibromyalgia, asthma, emesis, psoriasis, central nervous system 
disorders, and gastrointestinal diseases. Representative members of this group include 
Substance P receptor, Substance K receptor, Neuromedin K receptor 3, and Neuromedin K 
receptor 4. Modulation of tachykinin receptor interactions with PDZ proteins may provide 

25 effective treatment for those diseases listed above and many others. 

18. Vasopressin-like Receptors 

Vasopressin-like receptors are involved in many biological functions, 
including reproductive regulation and water metabolism. Oxytocin receptor is highly 
30 involved in the reproductive system, regulating parturition, lactation, and other reproductive 
functions. Representative members of this group include Vasopressin VIA, Vasopressin 
V1B, Vasopressin V2, and Oxytocin receptor. Modulation of vasopressin-like receptor 
interactions with PDZ proteins may provide an effective means for regulating, among 
others, reproductive function and water metabolism. 
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19. Galanin-like Receptors 

Galanin-like receptors are involved in a variety of diseases and disorders, 
including obesity, Alzheimer's disease, epilepsy, eating disorders, and depression. 
5 Representative members of this group include but are not limited to Galanin receptor type 1 , 
and Galanin receptor type 2. Modulation of galanin-like receptor interactions with PDZ 
proteins may provide effective treatments for many diseases, including those listed above. 

20. Proteinase-activated like Receptors 

10 Proteinase-activated like receptors are involved in vascular and 

cardiovascular disease, cancer, gastrointestinal disease and inflammation. Representative 
members of this group include but are not limited to Proteinase-activated receptor 2, 
Proteinase-activated receptor 3, and Thrombin receptor. Modulation of proteinase-activated 
like receptor interactions with PDZ proteins may provide effective treatments for those diseases 

15 listed above, in addition to many others. 

21. Orexin & Neuropeptide FF Receptors 

Orexin and Neuropeptide FF receptors are involved in many diseases, 
such as eating disorders and narcolepsy. Representative members of this group include but are 
20 not limited to Neuropeptide FF receptor 1 , Neuropeptide FF receptor 2, Orexin receptor 1 , and 
Orexin receptor 2. Modulation of orexin and neuropeptide FF receptor interactions with PDZ 
proteins may provide effective treatments for many diseases, including those listed above. 

22. Urotensin II Receptors 

25 

23. Adrenomedullin Receptors 
30 24. Endothelin B-like Receptors 

25. Chemokine Receptors 
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Chemokine receptors and their homologues form a group that is 
involved in many biological processes, including but not limited to immunosurveillance, 
inflammation, viral infection, lung disease, graft/transplant rejection, HIV infection, 
autoimmune disease, angiogenesis, tumorigenesis, wound healing, and metastasis. 
Modulation of chemokine receptor interactions with PDZ proteins may provide effective 
treatments for these and other diseases. 

26. Neuromedin U Receptors 

27. Hormone Receptors 

Hormone receptors are involved in a number of endocrine functions 
and diseases, including but not limited to Graves' disease, autoimmune hypothyroidism, and 
thyroid cancer. Representative members of this group include follicle-stimulating hormone 
receptor, lutropin-choriogonadotropic hormone receptor, thyrotropin receptor, luteinizing 
hormone receptor, and gonadotropin receptor. Modulation of hormone receptor interactions 
with PDZ proteins may provide effective treatments for these and other endocrine diseases. 

28. Rhodopsin Receptors 

Rhodopsin receptors are highly involved in the visual system, regulating 
signal transduction in response to light stimuli. Representative members of this group include 
but are not limited to blue-sensitive opsin receptor, red-sensitive opsin receptor, green-sensitive 
opsin receptor, and rhodopsin. Modulation of rhodopsin receptor interactions with PDZ 
proteins may provide treatment for many diseases of the visual system. 

29. Olfactory Receptors 

Olfactory receptors are involved primarily in the sense of smell. 
Representative members of this group include OR1 Al , OR1 C 1 , OR2A4, OR2B2, OR2W1 , and 
OR2J3, in addition to many others. Modulation of olfactory receptor interactions with PDZ 
proteins may provide treatments for temporary loss of smell and permanent anosmia. 

30. Adenosine Receptors 

Adenosine receptors and their homologues form a group that is 
involved in renal disease, asthma, Parkinson's disease, and many other diseases. 
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Representative members of this group include, but are not limited to Adenosine Al 
receptor, Adenosine A2A receptor, Adenosine A2B receptor, and Adenosine A3 receptor. 
Modulation of adenosine receptor interactions with PDZ proteins may provide effective 
treatments for the diseases listed above, and others. 

5 

31. Cannabis Receptors 

Cannabis receptors have been implicated in psychological disorders, 
hypotension, cardiovascular regulation, pain regulation, movement, memory, and appetite. In 
addition, they have been investigated as potential therapies for Huntington's Disease, 
1 0 Parkinson's disease, schizophrenia, and tremor. Representative members of this group include 
but are not limited to Cannabinoid receptor 1 and Cannabinoid receptor 2. Modulation of 
cannabis receptor interactions with PDZ proteins may provide therapies such as those listed 
above. 

15 32. Platelet Activating Factor Receptors 

33. Gonadotropin-releasing hormone Receptors 

20 

34. Thyrotropin-releasing hormone & Secretagogue Receptors 
Thyrotropin-releasing hormone & secretagogue receptors are known to 

be involved in many thyroid diseases, including hypo- and hyperthyroidism, amyotrophic 
lateral sclerosis (ALS), obesity, and gastrointestinal disorders such as inflammatory bowel 
25 disease and ulcerative colitis. Representative members of this group include but are not limited 
to Growth Hormone Secretagogue receptor type 1, Motilin receptor, and thyrotropin-releasing 
hormone receptor. Modulation of thyrotropin-releasing hormone & secretagogue receptor 
interactions with PDZ proteins may provide relief for the 

30 35. Melatonin Receptors 

Melatonin receptors are most commonly recognized for their role in the 
circadian rhythm, however, these receptors also play a role in the cerebrovascular, reproductive, 
visual, neuroendocrine, and neuro immunological systems. In addition, they are associated with 
cancer, rheumatoid arthritis, and reduction of NS AID-caused lesions. Representative members 
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of this group include but are not limited to melatonin receptor 1 A, melatonin receptor IB, and 
melatonin-related receptor. Modulation of melatonin receptor interactions with PDZ proteins 
may provide effective therapies and treatments for a variety of diseases, including those listed 
above. 

5 

36. Lysosphingolipid & LPA (EDG) Receptors 



37. Leukotrine Receptors 

10 

38. Calcitonin Receptors 

Calcitonin receptors play a role in bone mineral density, osteoporosis, 
and prostate cancer. In addition, calcitonin receptors have been implicated in renal function, 
15 embryonic development, and sperm function and physiology. Modulation of calcitonin 
receptor interactions with PDZ proteins may provide an effective means for treating diseases 
such as osteoporosis or prostate cancer. 

39. Corticotropin-releasing factor Receptors 

20 Corticotropin-releasing factor (CRF) receptors are known to be 

involved in the stress response, irritable bowel syndrome, obesity, depression, eating 
disorders, and cardiac and inflammatory diseases. Modulation of CRF receptor interactions 
with PDZ proteins may provide an effective means for treating stress and diseases 
associated with stress, including those listed above. 

25 

40. Gastric Inhibitory Peptide Receptors 



41. Glucagon Receptors 
30 Glucagon peptide and Glucagon-like peptide receptors form a group 

that is known to play a role in diabetes mellitus, obesity, and gastrointestinal repair and 
cytoprotection. In addition, glucagon receptors are integral to glucagonoma syndrome, 
which can be related to endocrine tumors. Modulation of glucagon receptor interactions 
with PDZ proteins may provide an effective treatment for diabetes, obesity, glucagonoma, 
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and disorders characterized by injury and/or dysfunction of the intestinal mucosal 
epithelium. 



42. 



Growth hormone-releasing hormone Receptors 



5 



43. 



Parathyroid hormone Receptors 



10 



44. 



PACAP Receptors 



45. 



Secretin-like Receptors 

Representative members of this group include but are not limited to 



15 gastric inhibitory peptide receptor, growth hormone-releasing hormone receptor, parathyroid 
hormone receptor, brain-specific angiogenesis inhibitor receptors, calcitonin receptors, CD97, 
cadherin EGF LAG receptor, corticotropin releasing factor receptors, cell surface glycoprotein 
EMR1 , glucagon-like peptide receptors, Latrophilin-1 receptor, PACAP receptor, Lectomedin 
receptors, and VIP receptors. Modulation of secretin-like receptor interactions with PDZ 

20 proteins may provide effective treatment for a variety of diseases and disorders. 



of autoimmune diseases, including but not limited to septic shock, rheumatoid arthritis, 
25 multiple sclerosis, Crohn's disease, asthma, and autoimmune diabetes. In addition, VTP 
receptors are known to be involved in the inflammatory response and pulmonary 
hypertension, and are fundamental to Verner-Morrison syndrome. Modulation of vasoactive 
intestinal peptide receptor interactions with PDZ proteins may provide an effective means 
for treating autoimmune diseases, affecting inflammatory responses, or alleviating the 
30 symptoms of Verner-Morrison Syndrome. 



46. Vasoactive intestinal polypeptide Receptors 

Vasoactive Intestinal Peptide (VIP) receptors play a role in a number 



47. Diuretic hormone Receptors 



48. EMR1 Receptors 
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49. Latrophilin Receptors 

50. Brain-Specific angiogenesis inhibitor (BAD Receptors 

51. Methuselah-like protein (MTH) Receptors 



52. Metabotropic glutamate receptors 

Metabotropic glutamate receptors are involved in inflammatory pain, 
anxiety, neurodegenerative diseases such as Parkinson's disease and Alzheimer's disease, 
15 brain ischemia, amyotrophic lateral sclerosis, and seizure disorders. Modulation of 
metabotropic glutamate receptor interactions with PDZ proteins may provide effective 
anticonvulsant and neuroprotective therapies and treatments for inflammatory and other 
disorders. 

20 53. GABA Receptors 

GAB A receptors, or gamma-aminobutyric acid receptors, play a critical 
role in the fine-tuning of central nervous system synaptic transmission and are attractive targets 
for the treatment of epilepsy, anxiety, depression, cognitive deficits, and nociceptive disorders. 
This family includes GBR1 and GBR2. 

25 54. Ocular Albinism protein Receptors 



55. Frizzled/Smoothened Receptors 

30 

56. Vomeronasal Receptors 



Thromboxane Receptors 

Thromboxane receptors and their homologues form a group that is 
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involved in inflammation, asthma, and cardiovascular disorders such as myocardial 
ischemia, hypertension, stroke, thrombosis, and restenosis. Modulation of thromboxane 
receptor interactions with PDZ proteins may provide effective treatments for many diseases, 
including but not limited to asthma, inflammation, and cardiovascular diseases. 

58. Prostaglandin Receptors 

Prostaglandin receptors are involved in arthritis, insomnia, colon 
cancer, and many other diseases. Prostaglandin receptors also play a large role in vascular 
contraction and thus are important effectors in, among others, inflammation, myocardial 
ischemia, hypertension, stroke, and thrombosis. Modulation of prostaglandin receptor 
interactions with PDZ proteins may provide effective treatments for vascular diseases, 
arthritis, colon cancer, insomnia, and other diseases. 

59. GPCR Receptors Expressed on T cell surface 

GPCRs are used for a number of function on the surface of T cells, 
including chemokine sensing, cytokine sensing, and environment sensing. Modulation of 
interactions between these receptors and PDZ proteins could be used to treat a wide variety of 
immune and inflammatory disorders. 

60. GPCR Receptors Expressed on B cell surface 

GPCRs are used for a number of function on the surface of B cells, 
including chemokine sensing, cytokine sensing, and environment sensing. Modulation of 
interactions between these receptors and PDZ proteins could be used to treat a wide variety of 
immune and inflammatory disorders. 

61. GPCR Receptors Expressed on NK cell surface 

62. GPCR Receptors Expressed on Monocyte surface 

63. GPCR Receptors Expressed on Granulocyte surface 
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64. 



GPCR Receptors Expressed on Endothelial Cell surface 



10 



65. GPCR Receptors Involved in the Immune Response 



66. GPCR Receptors Involved in the Cardiovascular System 



67. GPCR Receptors Involved in the Neural System 



68. GPCR Receptors Involved in the Inflammatory Response 

1 5 Many GPCRs are involved in inflammatory responses, whether vascular, 

histamine related or other inflammatory responses. Modulation of PDZrGPCRPL interactions 
could be used to treat these symptoms. 

69. GPCR Receptors Involved in Asthma and Allergic Inflammation 

20 A number of GPCR proteins containing a PL motif are involved in 

asthma and the allergic inflammatory response. These include, but are not limited to adrenergic 
receptors and leukotriene receptors. Modulation of PDZ:GPCRPL interactions could be used to 
treat these symptoms. 

25 70. GPCR Receptors Involved in Parkinson's Disease 

Glutamate, GABA and NMDA receptors have been implicated as 
potential drug targets that may slow progression of Parkinson's disease or treat the symptoms 
such that quality of life improves. 

71. Group 71 

30 Members of Group 71 include alpha 1 A- Adrenergic receptor, beta2- 

Adrenergic receptor, P2Y1 purinergic receptor, GRK6A , beta 1 -Adrenergic receptor, 
parathyroid hormone 1 receptor, parathyroid hormone 1 receptor, 5HT2B, platelet-derived 
growth factor receptor, mGLURla, mGLUR5, SSTR2, SSTR2, IL8RB, CL1 (a-latrotoxin), 
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5HT2B, B 1 AR, rat SSTR2, 5HT2C, SSTR2A, CIRL1 , CIRL2, CIRL1 & 2, prolactin-releasing 
peptide receptor, kappa opioid receptor, mGLUR7„ 

V. Detection of PDZ Domain-Containing Proteins 
5 As noted supra, the present inventors have identified a number of PDZ protein 

and PL protein interactions that can play a role in modulation of a number of biological 
functions in a variety of cell types. A comprehensive list of PDZ domain-containing proteins 
was retrieved from the Sanger Centre database (Pfam) searching for the keyword, "PDZ". The 
corresponding cDNA sequences were retrieved from GenBank using the NCBI "entrez" 

1 0 database (hereinafter, "GenBank PDZ protein cDNA sequences"). The DNA portion encoding 
PDZ domains was identified by alignment of cDNA and protein sequence using CLUSTALW. 
Based on the DNA/protein alignment information, primers encompassing the PDZ domains 
were designed. The expression of certain PDZ-containing proteins in cells was detected by 
polymerase chain reaction ("PCR") amplification of cDNAs obtained by reverse transcription 

1 5 ("RT") of cell-derived RNA (i.e., "RT-PCR"). PCR, RT-PCR and other methods for analysis 
and manipulation of nucleic acids are well known and are described generally in Sambrook et 
al, (1989) Molecular Cloning: A Laboratory Manual, 2nd Ed., Vols. 1-3, Cold Spring 
Harbor Laboratory hereinafter, "Sambrook"); and Ausubel et al., CURRENT PROTOCOLS IN 
Molecular Biology, Greene Publishing and Wiley-Interscience, New York (1997), as 

20 supplemented through January 1999 (hereinafter "Ausubel"). 

Samples of cDNA for those sequences identified through the foregoing search 
were obtained and then amplified. In general a sample of the cDNA (typically, 1/5 of a 20 \l\ 
reaction) was used to conduct PCR. PCR was conducted using primers designed to amplify 
specifically PDZ domain-containing regions of PDZ proteins of interest. Oligonucleotide 

25 primers were designed to amplify one or more PDZ-encoding domains. The DNA sequences 
encoding the various PDZ domains of interest were identified by inspection (i.e., conceptual 
translation of the PDZ protein cDNA sequences obtained from GenBank, followed by 
alignment with the PDZ domain amino acid sequence). TABLE 6 shows the PDZ-encoded 
domains amplified, and the GenBank accession number of the PDZ-domain containing 

30 proteins. To facilitate subsequent cloning of PDZ domains, the PCR primers included 
endonuclease restriction sequences at their ends to allow ligation with pGEX-3X cloning vector 
(Pharmacia, GenBank XXI 13 852 ) in frame with glutathione-S transferase (GST). 

TABLE 6 lists the proteins isolated for use in the aforementioned assays. 
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VI. Assays for Detection of Interactions Between PDZ-Domain Polypeptides and Candidate 
PDZ Ligand proteins (PL proteins) 

Two complementary assays, termed "A' and "G, ,m were developed to detect 
binding between a PDZ-domain polypeptide and candidate PDZ ligand. In each of the two 
5 different assays, binding is detected between a peptide having a sequence corresponding to the 
C-terminus of a protein anticipated to bind to one or more PDZ domains (i.e. a candidate PL 
peptide) and a PDZ-domain polypeptide (typically a fusion protein containing a PDZ domain). 
In the "A" assay, the candidate PL peptide is immobilized and binding of a soluble PDZ- 
domain polypeptide to the immobilized peptide is detected (the " A m assay is named for the fact 

1 0 that in one embodiment an avidin surface is used to immobilize the peptide). In the "G" assay, 
the PDZ-domain polypeptide is immobilized and binding of a soluble PL peptide is detected 
(The "G" assay is named for the fact that in one embodiment a GST-binding surface is used to 
immobilize the PDZ-domain polypeptide). Preferred embodiments of these assays are 
described in detail infra. However, it will be appreciated by ordinarily skilled practitioners that 

1 5 these assays can be modified in numerous ways while remaining useful for the purposes of the 
present invention. 

A. Production of Fusion Proteins Containing PDZ-Domains 

GST-PDZ domain fusion proteins were prepared for use in the assays of the 

20 invention. PCR products containing PDZ encoding domains (as described supra) were 
subcloned into an expression vector to permit expression of fusion proteins containing a PDZ 
domain and a heterologous domain (i.e., a glutathione-S transferase sequence, "GST"). PCR 
products (i.e., DNA fragments) representing PDZ domain encoding DNA was extracted from 
agarose gels using the "sephaglas" gel extraction system (Pharmacia) according to the 

25 manufacturer's recommendations. 

As noted supra, PCR primers were designed to include endonuclease restriction 
sites to facilitate ligation of PCR fragments into a GST gene fusion vector (pGEX-3X; 
Pharmacia, GenBank accession no. XXU13852) in-frame with the glutathione-S transferase 
coding sequence. This vector contains a IPTG inducible lacZ promoter. The pGEX-3X vector 

30 was linearized using Bam HI and Eco RI or, in some cases, Eco RI or Sma I, and 
dephosphorylated. For most cloning approaches, double digestion with Bam HI and Eco RI 
was performed, so that the ends of the PCR fragments to clone were Bam HI and Eco RI. In 
some cases, restriction endonuclease combinations used were Bgl II and Eco RI, Bam HI and 
Mfe I, or Eco RI only, Sma I only, or BamHI only. When more than one PDZ domain was 

38 



cloned, the DNA portion cloned represents the PDZ domains and the cDNA portion located 
between individual domains. Precise locations of cloned fragments used in the assays are 
indicated in TABLE 6. DNA linker sequences between the GST portion and the PDZ domain 
containing DNA portion vary slightly, dependent on which of the above described cloning sites 
5 and approaches were used. As a consequence, the amino acid sequence of the GST-PDZ fusion 
protein varies in the linker region between GST and PDZ domain. Protein linkers sequences 
corresponding to different cloning sites/approaches are shown below. Linker sequences (vector 
DNA encoded) are bold, PDZ domain containing gene derived sequences are in italics. 

1 ) GST — BamHI/BamHI— PDZ domain insert 
1 0 Gly~Ile — PDZ domain insert 

2) GST — B&mHl/Bglll—PDZ domain insert 
Gly — lie — PDZ domain insert 

15 3 ) GST — EcoKl/EcoI—PDZ domain insert 

Gly — He — Pro — Gly— Asn — PDZ domain insert 

4) GST-Smal/S/no/— PDZ domain insert 
Gly — He — Pro — PDZ domain insert 

20 

The PDZ-encoding PCR fragment and linearized pGEX-3X vector were ethanol 
precipitated and resuspended in 1 0 ul standard ligation buffer. Ligation was performed for 4-10 
hours at 7°C using T4 DNA ligase. It will be understood that some of the resulting constructs 
include very short linker sequences and that, when multiple PDZ domains were cloned, the 
25 constructs included some DNA located between individual PDZ domains. 

The ligation products were transformed in DH5a or BL-21 E.coli bacteria 
strains. Colonies were screened for presence and identity of the cloned PDZ domain containing 
DNA as well as for correct fusion with the glutathione S-transferase encoding DNA portion by 
PCR and by sequence analysis. Positive clones were tested in a small scale assay for 
30 expression of the GST/PDZ domain fusion protein and, if expressing, these clones were 
subsequently grown up for large scale preparations of GST/PDZ fusion protein. 

GST-PDZ domain fusion protein was overexpressed following addition of IPTG 
to the culture medium and purified. Detailed procedure of small scale and large scale fusion 
protein expression and purification are described in "GST Gene Fusion System" (second 
35 edition, revision 2; published by Pharmacia). In brief, a small culture (3-5mls) containing a 
bacterial strain (DH5a, BL2 1 or JM1 09) with the fusion protein construct was grown overnight 
in LB-media at 37°C with the appropriate antibiotic selection (lOOug/ml ampicillin; a.k.a. LB- 
amp). The overnight culture was poured into a fresh preparation of LB-amp (typically 250- 
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500mls) and grown until the optical density (OD) of the culture was between 0.5 and 0.9 
(approximately 2.5 hours). IPTG (isopropyl P-D-thiogalactopyranoside) was added to a final 
concentration of 1 .OmM to induce production of GST fusion protein, and culture was grown an 
additional 1 .5-2.5 hours. Bacteria were collect by centrifugation (4500 g) and resuspended in 
5 Buffer A- (50mM Tris, pH 8.0, 50mM dextrose, ImM EDTA, 200uM 
phenylmethylsulfonylfluoride). An equal volume of Buffer A+ (Buffer A-, 4mg/ml lysozyme) 
was added and incubated on ice for 3 min to lyse bacteria. An equal volume of Buffer B 
(lOmM Tris, pH 8.0, 50mM KC1, ImM EDTA. 0.5% Tween-20, 0.5% NP40 (a.k.a. IGEPAL 
CA-630), 200uM phenylmethylsulfonylfluoride) was added and incubated for an additional 20 

10 min. The bacterial cell lysate was centrifuged (x20,000g), and supernatant was added to 
glutathione Sepharose 4B (Pharmacia, cat no. 17-0765-01) previously swelled (rehydrated) in 
IX phosphate-buffered saline (PBS). The supernatant-Sepharose slurry was poured into a 
column and washed with at least 20 bed volumes of IX PBS. GST fusion protein was eluted 
off the glutathione sepharose by applying 0.5-1.0 ml aliquots of 5mM glutathione and 

15 collected as separate fractions. Concentrations of fractions were determined using BioRad 
Protein Assay (cat no. 500-0006) according to manufacturer's specifications. Those fractions 
containing the highest concentration of fusion protein were pooled and dialyzed against IX 
PBS/35% glycerol. Fusion proteins were assayed for size and quality by SDS gel 
electrophoresis (PAGE) as described in "Sambrook." Fusion protein aliquots were stored at 

20 minus 80°C and at minus 20°C. 

B. Identification of Candidate PL Proteins and Synthesis of Peptides 

Certain PDZ domains are bound by the C-terminal residues of PDZ-binding 
proteins. To identify PL proteins the C-terminal residues of sequences were visually inspected 
for sequences that one might predict would bind to PDZ-domain containing proteins (see, e.g., 
25 Doyle et al., 1996, Cell 85, 1067; Songyang et al., 1997, Science 275, 73). TABLE 3 lists 
these proteins, and provides corresponding C-terminal sequences and GenBank accession 
numbers. 

Synthetic peptides of defined sequence (e.g., corresponding to the carboxyl- 
termini of the indicated proteins) can be synthesized by any standard resin-based method (see, 
30 e.g., U. S. Pat. No. 4,108,846; see also, Caruthers et al., 1980, Nucleic Acids Res, Symp. Ser. } 
215-223; Horn et al., 1980, Nucleic Acids Res. Symp. Ser., 225-232; Roberge, et al., 1995, 
Science 269:202). The peptides used in the assays described herein were prepared by the 
FMOC (see, e.g., Guy and Fields, 1997, Meth. Enz. 289:67-83; Wellings and Atherton, 1997, 
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Meth. £wz.289:44-67). In some cases (e.g., for use in the A and G assays of the invention), 
peptides were labeled with biotin at the amino-terminus by reaction with a four- fold excess of 
biotin methyl ester in dimethylsulfoxide with a catalytic amount of base. The peptides were 
cleaved from the resin using a halide containing acid (e.g. trifluoroacetic acid) in the presence 
5 of appropriate antioxidants (e.g. ethanedithiol) and excess solvent lyophilized. 

Following lyophilization, peptides can be redissolved and purified by reverse 
phase high performance liquid chromatography (HPLC). One appropriate HPLC solvent 
system involves a Vydac C-18 semi-preparative column running at 5 mL per minute with 
increasing quantities of acetonitrile plus 0.1% trifluoroacetic acid in a base solvent of water 
10 plus 0.1% trifluoroacetic acid. After HPLC purification, the identities of the peptides are 
confirmed by MALDI cation-mode mass spectrometry. As noted, exemplary biotinylated 
peptides are provided in TABLE 3. 

C. Detecting PDZ-PL Interactions 

1 5 The present inventors were able in part to identify the interactions summarized 

in TABLE 2 by developing new high throughput screening assays which are described in 
greater detail infra. Various other assay formats known in the art can be used to select ligands 
that are specifically reactive with a particular protein. For example, solid-phase ELISA 
immunoassays, immunoprecipitation, Biacore, and Western blot assays can be used to identify 

20 peptides that specifically bind PDZ-domain polypeptides. As discussed supra, two different, 
complementary assays were developed to detect PDZ-PL interactions. In each, one binding 
partner of a PDZ-PL pair is immobilized, and the ability of the second binding partner to bind 
is determined. These assays, which are described infra, can be readily used to screen for 
hundreds to thousand of potential PDZ-ligand interactions in a few hours. Thus these assays 

25 can be used to identify yet more novel PDZ-PL interactions in hematopoietic cells. In addition, 
they can be used to identify antagonists of PDZ-PL interactions (see infra). 

In various embodiments, fusion protein are used in the assays and devices of the 
invention. Methods for constructing and expressing fusion proteins are well known. Fusion 
proteins generally are described in Ausubel et al, supra, Kroll et al., 1993, DNA Cell. Biol. 

30 12:441, and ImaietaL, 1997, Cell 91:521-30. Usually, the fusion protein includes a domain to 
facilitate immobilization of the protein to a solid substrate ("an immobilization domain"). 
Often, the immobilization domain includes an epitope tag (i.e., a sequence recognized by a 
antibody, typically a monoclonal antibody) such as polyhistidine (Bush et al, 1991, J. Biol 
Chem 266:1381 1-14), SEAP (Berger et al, 1988, Gene 66:1-10), or Ml and M2 flag (see, e.g, 



U.S. Pat. Nos. 5,011,912; 4,851,341; 4,703,004; 4,782,137). In an embodiment, the 
immobilization domain is a GST coding region. It will be recognized that, in addition to the 
PDZ-domain and the particular residues bound by an immobilized antibody, protein A, or 
otherwise contacted with the surface, the protein (e.g., fusion protein), will contain additional 
5 residues. In some embodiments these are residues naturally associated with the PDZ-domain 
(i.e., in a particular PDZ-protein) but they may include residues of synthetic (e.g., 
poly(alanine)) or heterologous origin (e.g., spacers of, e.g., between 10 and 300 residues). 

PDZ domain-containing polypeptide used in the methods of the invention (e.g., 
PDZ fusion proteins) of the invention are typically made by (1) constructing a vector (e.g., 

10 plasmid, phage or phagemid) comprising a polynucleotide sequence encoding the desired 
polypeptide, (2) introducing the vector into an suitable expression system (e.g., a prokaryotic, 
insect, mammalian, or cell free expression system), (3) expressing the fusion protein and (4) 
optionally purifying the fusion protein. 

(1) In one embodiment, expression of the protein comprises inserting the 

1 5 coding sequence into an appropriate expression vector (i.e., a vector that contains the necessary 
elements for the transcription and translation of the inserted coding sequence required for the 
expression system employed, e.g., control elements including enhancers, promoters, 
transcription terminators, origins of replication, a suitable initiation codon (e.g., methionine), 
open reading frame, and translational regulatory signals (e.g., a ribosome binding site, a 

20 termination codon and a polyadenylation sequence. Depending on the vector system and host 
utilized, any number of suitable transcription and translation elements, including constitutive 
and inducible promoters, can be used. 

The coding sequence of the fusion protein includes a PDZ domain and an 
immobilization domain as described elsewhere herein. Polynucleotides encoding the amino 

25 acid sequence for each domain can be obtained in a variety of ways known in the art; typically 
the polynucleotides are obtained by PCR amplification of cloned plasmids, cDNA libraries, and 
cDNA generated by reverse transcription of RNA, using primers designed based on sequences 
determined by the practitioner or, more often, publicly available (e.g., through GenBank). The 
primers include linker regions (e.g., sequences including restriction sites) to facilitate cloning 

30 and manipulation in production of the fusion construct. The polynucleotides corresponding to 
the PDZ and immobilization regions are joined in-frame to produce the fusion protein-encoding 
sequence. 
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The fusion proteins of the invention may be expressed as secreted proteins (e.g., 
by including the signal sequence encoding DNA in the fusion gene; see, e.g., Lui et al, 1993, 
PNAS USA, 90:8957-61) or as nonsecreted proteins. 

In some embodiments, the PDZ-containing proteins are immobilized on a solid 
5 surface. The substrate to which the polypeptide is bound may in any of a variety of forms, e.g., 
a microtiter dish, a test tube, a dipstick, a microcentrifuge tube, a bead, a spinnable disk, and 
the like. Suitable materials include glass, plastic (e.g., polyethylene, PVC, polypropylene, 
polystyrene, and the like), protein, paper, carbohydrate, lipip monolayer or supported lipid 
bilayer, and other solid supports. Other materials that may be employed include ceramics, 
10 metals, metalloids, semiconductive materials, cements and the like. 

In some embodiments, the fusion proteins are organized as an array. The term 
"array," as used herein, refers to an ordered arrangement of immobilized fusion proteins, in 
which particular different fusion proteins (i.e., having different PDZ domains) are located at 
different predetermined sites on the substrate. Because the location of particular fusion proteins 
15 on the array is known, binding at that location can be correlated with binding to the PDZ 
domain situated at that location. Immobilization of fusion proteins on beads (individually or in 
groups) is another particularly useful approach. In one embodiment individual fusion proteins 
are immobilized on beads. In one embodiment, mixtures of distinguishable beads are used. 
Distinguishable beads are beads that can be separated from each other on the basis of a property 
20 such as size, magnetic property, color (e.g.. using FACS) or affinity tag (e.g., a bead coated 
with protein A can be separated from a bead not coated with protein A by using IgG affinity 
methods). Binding to particular PDZ domain may be determined; similarly, the effect of test 
compounds (i.e., agonists and antagonists of binding) may be determined. 

Methods for immobilizing proteins are known, and include covalent and non- 
25 covalent methods. One suitable immobilization method is antibody-mediated immobilization. 
According to this method, an antibody specific for the sequence of an "immobilization domain" 
of the PDZ-domain containing protein is itself immobilized on the substrate (e.g., by 
adsorption). One advantage of this approach is that a single antibody may be adhered to the 
substrate and used for immobilization of a number of polypeptides (sharing the same 
30 immobilization domain). For example, an immobilization domain consisting of poly-histidine 
(Bush et al, 1991, J. Biol Chem 266: 1381 1-14) can be bound by an anti-histidine monoclonal 
antibody (R&D Systems, Minneapolis, MN); an immobilization domain consisting of secreted 
alkaline phosphatase ("SEAP") (Berger et al, 1988, Gene 66: 1-10) can be bound by anti-SEAP 
(Sigma Chemical Company, St. Louis, MO); an immobilization domain consisting of a FLAG 
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epitope can be bound by anti-FLAG. Other ligand-antiligand immobilization methods are also 
suitable (e.g., an immobilization domain consisting of protein A sequences (Harlow and Lane, 
1 988, Antibodies A Laboratory Manual, Cold Spring Harbor Laboratory; Sigma Chemical Co., 
St. Louis, MO) can be bound by IgG; and an immobilization domain consisting of streptavidin 
5 can be bound by biotin (Harlow & Lane, supra; Sigma Chemical Co., St. Louis, MO). In a 
preferred embodiment, the immobilization domain is a GST moiety, as described herein. 

When antibody-mediated immobilization methods are used, glass and plastic are 
especially useful substrates. The substrates may be printed with a hydrophobic (e.g., Teflon) 
mask to form wells. Preprinted glass slides with 3, 10 and 21 wells per 14.5 cm 2 slide 

10 "working area" are available from, e.g., SPI Supplies, West Chester, PA; also see U.S. Pat No. 
4,01 1,350). In certain applications, a large format (12.4 cm x 8.3 cm) glass slide is printed in a 
96 well format is used; this format facilitates the use of automated liquid handling equipment 
and utilization of 96 well format plate readers of various types (fluorescent, colorimetric, 
scintillation). However, higher densities may be used (e.g., more than 10 or 100 polypeptides 

15 per cm 2 ). See, e.g., MacBeath et al, 2000, Science 289:1760-63. 

Typically, antibodies are bound to substrates (e.g., glass substrates) by 
adsorption. Suitable adsorption conditions are well known in the art and include incubation of 
0.5-50)Lig/ml (e.g., 10 ng/ml) mAb in buffer (e.g., PBS, or 50 to 300 mM Tris, MOPS, HEPES, 
PIPES, acetate buffers, pHs 6.5 to 8, at 4°C) to 37°C and from lhr to more than 24 hours. 

20 Proteins may be covalently bound or noncovalently attached through nonspecific 

bonding. If covalent bonding between a the fusion protein and the surface is desired, the 
surface will usually be polyfunctional or be capable of being polyflxnctionalized. Functional 
groups which may be present on the surface and used for linking can include carboxylic acids, 
aldehydes, amino groups, cyano groups, ethylenic groups, hydroxyl groups, mercapto groups 

25 and the like. The manner of linking a wide variety of compounds to various surfaces is well 
known and is amply illustrated in the literature. 

"A Assay" Detection of PDZ-Ligand Binding Using Immobilized PL Peptide. 

In one aspect, the invention provides an assay in which biotinylated candidate 
30 PL peptides are immobilized on an avidin coated surface. The binding of PDZ-domain fusion 
protein to this surface is then measured. In a preferred embodiment, the PDZ -domain fusion 
protein is a GST/PDZ fusion protein and the assay is carried out as follows: 
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(1) Avidin is bound to a surface, e.g. a protein binding surface. In one 
embodiment, avidin is bound to a polystyrene 96 well plate (e.g., Nunc Polysorb (cat #475094) 
by addition of 100 \XL per well of 20 jig/mL of avidin (Pierce) in phosphate buffered saline 
without calcium and magnesium, pH 7.4 ("PBS", GibcoBRL) at 4°C for 12 hours. The plate is 

5 then treated to block nonspecific interactions by addition of 200 |iL per well of PBS containing 
2 g per 100 mL protease-free bovine serum albumin ("PBS/BSA") for 2 hours at 4°C. The plate 
is then washed 3 times with PBS by repeatedly adding 200 ^iL per well of PBS to each well of 
the, plate and then dumping the contents of the plate into a waste container and tapping the 
plate gently on a dry surface. 

10 

(2) Biotinylated PL peptides (or candidate PL peptides, e.g. see TABLE 3) 
are immobilized on the surface of wells of the plate by addition of 50 (iL per well of 0.4 ^iM 
peptide in PBS/BSA for 30 minutes at 4°C. Usually, each different peptide is added to at least 
eight different wells so that multiple measurements (e.g. duplicates and also measurements 

15 using different (3ST/PDZ-domain fusion proteins and a GST alone negative control) can be 
made, and also additional negative control wells are prepared in which no peptide is 
immobilized. Following immobilization of the PL peptide on the surface, the plate is washed 3 
times with PBS. 

20 (3) GST/PDZ-domain fusion protein (prepared as described supra) is 

allowed to react with the surface by addition of 50 |iL per well of a solution containing 5 
(ig/mL GST/PDZ-domain fusion protein in PBS/BSA for 2 hours at 4°C. As a negative 
control, GST alone (i.e. not a fusion protein) is added to specified wells, generally at least 2 
wells (i.e. duplicate measurements) for each immobilized peptide. After the 2 hour reaction, 

25 the plate is washed 3 times with PBS to remove unbound fusion protein. 

(4) The binding of the GST/PDZ-domain fusion protein to the avidin- 
biotinylated peptide surface can be detected using a variety of methods, and detectors known in 
the art. In one embodiment, 50 |iL per well of an anti-GST antibody in PBS/BSA (e.g. 2.5 
30 (ig/mL of polyclonal goat-anti-GST antibody, Pierce) is added to the plate and allowed to react 
for 20 minutes at 4°C. The plate is washed 3 times with PBS and a second, detectably labeled 
antibody is added. In one embodiment, 50 ^L per well of 2.5 ^ig/mL of horseradish peroxidase 
(HRP)-conjugated polyclonal rabbit anti-goat immunoglobulin antibody is added to the plate 
and allowed to react for 20 minutes at 4°C. The plate is washed 5 times with 50 mM Tris pH 
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8.0 containing 0.2% Tween 20, and developed by addition of 100 (iL per well of HRP-substrate 
solution (TMB, Dako) for 20 minutes at room temperature (RT). The reaction of the HRP and 
its substrate is terminated by the addition of 100 j-iL per well of 1M sulfuric acid and the optical 
density (O.D.) of each well of the plate is read at 450 nm. 

5 

(5) Specific binding of a PL peptide and a PDZ-domain polypeptide is 
detected by comparing the signal from the well(s) in which the PL peptide and PDZ domain 
polypeptide are combined with the background signal(s). The background signal is the signal 
found in the negative controls. Typically a specific or selective reaction will be at least twice 

10 background signal, more typically more than 5 times background, and most typically 10 or 
more times the background signal. In addition, a statistically significant reaction will involve 
multiple measurements of the reaction with the signal and the background differing by at least 
two standard errors, more typically four standard errors, and most typically six or more 
standard errors. Correspondingly, a statistical test (e.g. a T-test) comparing repeated 

15 measurements of the signal with repeated measurements of the background will result in a p- 
value < 0.05, more typically a p-value < 0.01, and most typically a p-value < 0.001 or less. 

As noted, in an embodiment of the "A" assay, the signal from binding of a 
GST/PDZ-domain fusion protein to an avidin surface not exposed to (i.e. not covered with) the 
PL peptide is one suitable negative control (sometimes referred to as "B"). The signal from 

20 binding of GST polypeptide alone (i.e. not a fusion protein) to an avidin-coated surface that has 
been exposed to (i.e. covered with) the PL peptide is a second suitable negative control 
(sometimes referred to as "B2"). Because all measurements are done in multiples (i.e. at least 
duplicate) the arithmetic mean (or, equivalently, average) of several measurements is used in 
determining the binding, and the standard error of the mean is used in determining the probable 

25 error in the measurement of the binding. The standard error of the mean of N measurements 
equals the square root of the following: the sum of the squares of the difference between each 
measurement and the mean, divided by the product of (N) and (N-l). Thus, in one 
embodiment, specific binding of the PDZ protein to the plate-bound PL peptide is determined 
by comparing the mean signal ("mean S") and standard error of the signal ("SE") for a 

30 particular PL-PDZ combination with the mean Bl and/or mean B2. 

" G Assay" - Detection of PDZ-Ligand Binding Using Immobilized PDZ- 
Domain Fusion Polypeptide 
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In one aspect, the invention provides an assay in which a GST/PDZ fusion 
protein is immobilized on a surface ("G" assay). The binding of labeled PL peptide (e.g., as 
listed in TABLE 2) to this surface is then measured. In a preferred embodiment, the assay is 
carried out as follows: 

5 

(1) A PDZ-domain polypeptide is bound to a surface, e.g. a protein binding 
surface. In a preferred embodiment, a GST/PDZ fusion protein containing one or more PDZ 
domains is bound to a polystyrene 96-well plate. The GST/PDZ fusion protein can be bound to 
the plate by any of a variety of standard methods known to one of skill in the art, although 
1 0 some care must be taken that the process of binding the fusion protein to the plate does not alter 
the ligand-binding properties of the PDZ domain. In one embodiment, the GST/PDZ fusion 
protein is bound via an anti-GST antibody that is coated onto the 96-well plate. Adequate 
binding to the plate can be achieved when: 

a. 1 00 jjL per well of 5 jJg/mL goat anti-GST polyclonal antibody 
15 (Pierce) in PBS is added to a polystyrene 96-well plate (e.g., Nunc Polysorb) at 4°C for 12 

hours. 

b. The plate is blocked by addition of 200 ^iL per well of PBS/BSA 

for 2 hours at 4°C. 

c. The plate is washed 3 times with PBS. 

20 d. 50 ^L per well of 5 )ig/mL GST/PDZ fusion protein) or, as a 

negative control, GST polypeptide alone (i.e. not a fusion protein) in PBS/BSA is added to the 
plate for 2 hours at 4°C. 

e. the plate is again washed 3 times with PBS. 

25 (2) Biotinylated PL peptides are allowed to react with the surface by 

addition of 50 ^iL per well of 20 ^iM solution of the biotinylated peptide in PBS/BSA for 10 
minutes at 4°C, followed by an additional 20 minute incubation at 25°C. The plate is washed 3 
times with ice cold PBS. 

30 (3) The binding of the biotinylated peptide to the GST/PDZ fusion protein 

surface can be detected using a variety of methods and detectors known to one of skill in the 
art. In one embodiment, 100 ^L per well of 0.5 ng/mL streptavidin-horse radish peroxidase 
(HRP) conjugate dissolved in BSA/PBS is added and allowed to react for 20 minutes at 4°C. 
The plate is then washed 5 times with 50 mM Tris pH 8.0 containing 0.2% Tween 20, and 
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developed by addition of 100 \xL per well of HRP-substrate solution (TMB, Dako) for 20 
minutes at room temperature (RT). The reaction of the HRP and its substrate is terminated by 
addition of 100 \lL per well of 1 M sulfuric acid, and the optical density (O.D.) of each well of 
the plate is read at 450 um. 

5 

(4) Specific binding of a PL peptide and a PDZ domain polypeptide is 
determined by comparing the signal from the well(s) in which the PL peptide and PDZ domain 
polypeptide are combined, with the background signal(s). The background signal is the signal 
found in the negative control(s). Typically a specific or selective reaction will be at least twice 

10 background signal, more typically more than 5 times background, and most typically 10 or 
more times the background signal. In addition, a statistically significant reaction will involve 
multiple measurements of the reaction with the signal and the background differing by at least 
two standard errors, more typically four standard errors, and most typically six or more 
standard errors. Correspondingly, a statistical test (e.g. a T-test) comparing repeated 

1 5 measurements of the signal with -repeated measurements of the background will result in a p- 
value < 0.05, more typically a p- value < 0.01, and most typically a p-value < 0.001 or less. As 
noted, in an embodiment of the "G" assay, the signal from binding of a given PL peptide to 
immobilized (surface bound) GST polypeptide alone is one suitable negative control 
(sometimes referred to as "B 1 "). Because all measurement are done in multiples (i.e. at least 

20 duplicate) the arithmetic mean (or, equivalently, average.) of several measurements is used in 
determining the binding, and the standard error of the mean is used in determining the probable 
error in the measurement of the binding. The standard error of the mean of N measurements 
equals the square root of the following: the sum of the squares of the difference between each 
measurement and the mean, divided by the product of (N) and (N-l). Thus, in one 

25 embodiment, specific binding of the PDZ protein to the platebound peptide is determined by 
comparing the mean signal ("mean S") and standard error of the signal ("SE") for a particular 
PL-PDZ combination with the mean Bl. 

"G ? assay" and "G" assay" 

Two specific modifications of the specific conditions described supra for the "G 
30 assay" are particularly useful. The modified assays use lesser quantities of labeled PL peptide 
and have slightly different biochemical requirements for detection of PDZ-ligand binding 
compared to the specific assay conditions described supra. 

For convenience, the assay conditions described in this section are referred to as 
the "G' assay" and the "G" assay," with the specific conditions described in the preceding 
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section on G assays being referred to as the "G° assay/ 5 The "G 5 assay" is identical to the "G° 
assay" except at step (2) the peptide concentration is 10 uM instead of 20 uM. This results in 
slightly lower sensitivity for detection of interactions with low affinity and/or rapid dissociation 
rate. Correspondingly, it slightly increases the certainty that detected interactions are of 
5 sufficient affinity and half-life to be of biological importance and useful therapeutic targets. 

The "G" assay" is identical to the "G° assay" except that at step (2) the peptide 
concentration is 1 fxM instead of 20 ^iM and the incubation is performed for 60 minutes at 25°C 
(rather than, e.g., 10 minutes at 4°C followed by 20 minutes at 25°C). This results in lower 
sensitivity for interactions of low affinity, rapid dissociation rate, and/or affinity that is less at 

10 25°C than at 4°C. Interactions will have lower affinity at 25°C than at 4°C if (as we have found 
to be generally true for PDZ-ligand binding) the reaction entropy is negative (i.e. the entropy of 
the products is less than the entropy of the reactants). In contrast, the PDZ-PL binding signal 
may be similar in the "G" assay" and the "G° assay" for interactions of slow association and 
dissociation rate, as the PDZ-PL complex will accumulate during the longer incubation of the 

15 "G" assay." Thus comparison of results of the "G 55 assay" and the "G° assay" can be used to 
estimate the relative entropies, enthalpies, and kinetics of different PDZ-PL interactions. 
(Entropies and enthalpies are related to binding affinity by the equations delta G = RT In (Kd) = 
delta H - T delta S where delta G, H, and S are the reaction free energy, enthalpy, and entropy 
respectively, T is the temperature in degrees Kelvin, R is the gas constant, and Kd is the 

20 equilibrium dissociation constant). In particular, interactions that are detected only or much 
more strongly in the "G° assay" generally have a rapid dissociation rate at 25°C (tl/2 < 10 
minutes) and a negative reaction entropy, while interactions that are detected similarly strongly 
in the "G' 5 assay" generally have a slower dissociation rate at 25°C (tl/2 > 1 0 minutes). Rough 
estimation of the thermodynamics and kinetics of PDZ-PL interactions (as can be achieved via 

25 comparison of results of the "G° assay" versus the "G" assay" as outlined supra) can be used in 
the design of efficient inhibitors of the interactions. For example, a small molecule inhibitor 
based on the chemical structure of a PL that dissociates slowly from a given PDZ domain (as 
evidenced by similar binding in the "G" assay" as in the "G° assay") may itself dissociate 
slowly and thus be of high affinity. 

30 In this manner, variation of the temperature and duration of step (2) of the "G 

assay" can be used to provide insight into the kinetics and thermodynamics of the PDZ-ligand 
binding reaction and into design of inhibitors of the reaction. 

Assay Variations 
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As discussed supra, it will be appreciated that many of the steps in the above- 
described assays can be varied, for example, various substrates can be used for binding the PL 
and PDZ-containing proteins; different types of PDZ containing fusion proteins can be used; 
different labels for detecting PDZ/PL interactions can be employed; and different ways of 
5 detection can be used. 

The PL protein used in the assay is not intended to be limited to a 20 amino acid 
peptide. Full length or partial protein may be used, either alone or as a fusion protein. For 
example, a GST-PL protein fusion may be bound to the anti-GST antibody, with PDZ protein 
added to the bound PL protein or peptide. 

1 0 The PDZ -PL detection assays can employ a variety of surfaces to bind the PL 

and PDZ-containing proteins. For example, a surface can be an "assay plate" which is formed 
from a material (e.g. polystyrene) which optimizes adherence of either the PL protein or PDZ- 
containing protein thereto. Generally, the individual wells of the assay plate will have a high 
surface area to volume ratio and therefore a suitable shape is a flat bottom well (where the 

1 5 proteins of the assays are adherent). Other surfaces include, but are not limited to, polystyrene 
or glass beads, polystyrene or glass slides, and the like. 

For example, the assay plate can be a "microtiter" plate. The term "microtiter" 
plate when used herein refers to a multiwell assay plate, e.g., having between about 30 to 200 
individual wells, usually 96 wells. Alternatively, high density arrays can be used. Often, the 

20 individual wells of the microtiter plate will hold a maximum volume of about 250 ul. 
Conveniently, the assay plate is a 96 well polystyrene plate (such as that sold by Becton 
Dickinson Labware, Lincoln Park, N.J.), which allows for automation and high throughput 
screening. Other surfaces include polystyrene microtiter ELISA plates such as that sold by 
Nunc Maxisorp, Inter Med, Denmark. Often, about 50 ul to 300 ul, more preferably 100 ul to 

25 200 ul, of an aqueous sample comprising buffers suspended therein will be added to each well 
of the assay plate. 

The detectable labels of the invention can be any detectable compound or 
composition which is conjugated directly or indirectly with a molecule (such as described 
above). The label can be detectable by itself (e.g., radioisotope labels or fluorescent labels) or, 
30 in the case of an enzymatic label, can catalyze a chemical alteration of a substrate compound or 
composition which is detectable. The preferred label is an enzymatic one which catalyzes a 
color change of a non-radioactive color reagent. 

Sometimes, the label is indirectly conjugated with the antibody. One of skill is 
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aware of various techniques for indirect conjugation. For example, the antibody can be 
conjugated with biotin and any of the categories of labels mentioned above can be conjugated 
with avidin, or vice versa (see also "A" and "G" assay above). Biotin binds selectively to 
avidin and thus, the label can be conjugated with the antibody in this indirect manner. See, 
5 Ausubel, supra, for a review of techniques involving biotin-avidin conjugation and similar 
assays. Alternatively, to achieve indirect conjugation of the label with the antibody, the 
antibody is conjugated with a small hapten (e.g. digoxin) and one of the different types of labels 
mentioned above is conjugated with an anti-hapten antibody (e.g. anti-digoxin antibody). Thus, 
indirect conjugation of the label with the antibody can be achieved. 

10 Assay variations can include different washing steps. By "washing" is meant 

exposing the solid phase to an aqueous solution (usually a buffer or cell culture media) in such 
a way that unbound material (e.g., non-adhering cells, non-adhering capture agent, unbound 
ligand, receptor, receptor construct, cell lysate, or HRP antibody) is removed therefrom. To 
reduce background noise, it is convenient to include a detergent (e.g., Triton X) in the washing 

15 solution. Usually, the aqueous washing solution is decanted from the wells of the assay plate 
following washing. Conveniently, washing can be achieved using an automated washing 
device. Sometimes, several washing steps (e.g., between about 1 to 10 washing steps) can be 
required. 

Various buffers can also be used in PDZ-PL detection assays. For example, 
20 various blocking buffers can be used to reduce assay background. The term "blocking buffer" 
refers to an aqueous, pH buffered solution containing at least one blocking compound which is 
able to bind to exposed surfaces of the substrate which are not coated with a PL or PDZ- 
containing protein. The blocking compound is normally a protein such as bovine serum 
albumin (BSA), gelatin, casein or milk powder and does not cross-react with any of the 
25 reagents in the assay. The block buffer is generally provided at a pH between about 7 to 7.5 
and suitable buffering agents include phosphate and TRIS. 

Various enzyme-substrate combinations can also be utilized in detecting PDZ- 
PL interactions. Examples of enzyme-substrate combinations include, for example: 

(i) Horseradish peroxidase (HRPO) with hydrogen peroxidase as a substrate, 
30 wherein the hydrogen peroxidase oxidizes a dye precursor (e.g. orthophenylene diamine [OPD] 

or 3,3',5,5'-tetramethyl benzidine hydrochloride [TMB]) (as described above). 

(ii) alkaline phosphatase (AP) with para-Nitrophenyl phosphate as chromogenic 

substrate. 

(iii) 6-D-galactosidase (B D-Gal) with a chromogenic substrate (e.g. p- 
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nitrophenyl- B-D-galactosidase) or fluorogenic substrate 4-methylumbelliferyl- B-D- 
galactosidase. 

Numerous other enzyme-substrate combinations are available to those skilled in 
the art. For a general review of these, see U.S. Pat Nos. 4,275,149 and 4,318,980, both of 
5 which are herein incorporated by reference. 

Further, it will be appreciated that, although, for convenience, the present 
discussion primarily refers antagonists of PDZ-PL interactions, agonists of PDZ-PL 
interactions can be identified using the methods disclosed herein or readily apparent variations 
thereof 

10 

VII. Results of PDZ-PL Interaction Assays 

TABLE 2, supra, shows the results of assays in which specific binding was 
detected using the "G"' assay described herein. 

1 5 VIII. Measurement of PDZ-Ligand Binding Affinity 

The "A" and "G" assays of the invention can be used to determine the "apparent 
affinity" of binding of a PDZ ligand peptide to a PDZ-domain polypeptide. Apparent affinity is 
determined based on the concentration of one molecule required to saturate the binding of a 
second molecule (e.g., the binding of a ligand to a receptor). Two particularly useful 

20 approaches for quantitation of apparent affinity of PDZ-ligand binding are provided infra. 

(1) A GST/PDZ fusion protein, as well as GST alone as a negative control, are 
bound to a surface (e.g., a 96-well plate) and the surface blocked and washed as described supra 
for the "G" assay. 

(2) 50 |iL per well of a solution of biotinylated PL peptide (e.g. as shown in 
25 TABLE 3) is added to the surface in increasing concentrations in PBS/BSA (e.g. at 0.1 jiM, 

0.33 |liM, 1 jaM, 3.3 pJVL 10 )iM, 33 |iM, and 100 \xM). In one embodiment, the PL peptide is 
allowed to react with the bound GST/PDZ fusion protein (as well as the GST alone negative 
control) for 10 minutes at 4°C followed by 20 minutes at 25°C. The plate is washed 3 times 
with ice cold PBS to remove unbound labeled peptide. 
30 (3) The binding of the PL peptide to the immobilized PDZ-domain polypeptide is 

detected as described supra for the "G" assay. 

(4) For each concentration of peptide, the net binding signal is determined by 
subtracting the binding of the peptide to GST alone from the binding of the peptide to the 
GST/PDZ fusion protein. The net binding signal is then plotted as a function of ligand 
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concentration and the plot is fit (e.g. by using the Kaleidagraph software package curve fitting 
algorithm) to the following equation, where "Signal [ligand] " is the net binding signal at PL peptide 
concentration "[ligand]," "Kd" is the apparent affinity of the binding event, and "Saturation 
Binding" is a constant determined by the curve fitting algorithm to optimize the fit to the 
5 experimental data: 

Signal [ligand] = Saturation Binding x ([ligand] / ([ligand] + Kd)) 

For reliable application of the above equation it is necessary that the highest 
peptide ligand concentration successfully tested experimentally be greater than, or at least 
1 0 similar to, the calculated Kd (equivalently, the maximum observed binding should be similar to 
the calculated saturation binding). In cases where satisfying the above criteria proves difficult, 
an alternative approach (infra) can be used. 

Approach 2: 

(1) A fixed concentration of a PDZ-domain polypeptide and increasing 
15 concentrations of a labeled PL peptide (labeled with, for example, biotin or fluorescein, see 

TABLE 2for representative peptide amino acid sequences) are mixed together in solution and 
allowed to react. In one embodiment, preferred peptide concentrations are 0.1 |^M, 1 ^M, 10 
jaM, 100 |LiM, 1 mM. In various embodiments, appropriate reaction times can range from 10 
minutes to 2 days at temperatures ranging from 4°C to 37°C. In some embodiments, the 
20 identical reaction can also be carried out using a non-PDZ domain-containing protein as a 
control (e.g.. if the PDZ-domain polypeptide is fusion protein, the fusion partner can be used). 

(2) PDZ -ligand complexes can be separated from unbound labeled peptide 
using a variety of methods known in the art. For example, the complexes can be separated using 
high performance size-exclusion chromatography (HPSEC, gel filtration) (Rabinowitz et al, 

25 1 998, Immunity 9:699), affinity chromatography(e.g. using glutathione Sepharose beads), and 
affinity absorption (e.g., by binding to an anti-GST-coated plate as described supra), 

(3) The PDZ-ligand complex is detected based on presence of the label on 
the peptide ligand using a variety of methods and detectors known to one of skill in the art. For 
example, if the label is fluorescein and the separation is achieved using HPSEC, an in-line 

30 fluorescence detector can be used. The binding can also be detected as described supra for the 
G assay. 

(4) The PDZ-ligand binding signal is plotted as a function of ligand 
concentration and the plot is fit. (e.g., by using the Kaleidagraph software package curve 
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fitting algorithm) to the following equation, where "Signal [Jigand " is the binding signal at PL 
peptide concentration "[ligand]," "Kd" is the apparent affinity of the binding 
event, and "Saturation Binding" is a constant determined by the curve fitting algorithm to 
optimize the fit to the experimental data: 

5 

Signal [Ligand] = Saturation Binding x ([ligand] / ([ligand + Kd]) 

Measurement of the affinity of a labeled peptide ligand binding to a PDZ- 

10 domain polypeptide n is useful because knowledge of the affinity (or apparent affinity) of this 
interaction allows rational design of inhibitors of the interaction with known potency. The 
potency of inhibitors in inhibition would be similar to (i.e. within one-order of magnitude of) 
the apparent affinity of the labeled peptide ligand binding to the PDZ-domain. 

Thus, in one aspect, the invention provides a method of determining the 

1 5 apparent affinity of binding between a PDZ domain and a ligand by immobilizing a polypeptide 
comprising the PDZ domain and a non-PDZ domain on a surface, contacting the immobilized 
polypeptide with a plurality of different concentrations of the ligand, determining the amount of 
binding of the ligand to the immobilized polypeptide at each of the concentrations of ligand, 
and calculating the apparent affinity of the binding based on that data. Typically, the 

20 polypeptide comprising the PDZ domain and a non-PDZ domain is a fusion protein. In one 
embodiment, the e.g., fusion protein is GST-PDZ fusion protein, but other polypeptides can 
also be used (e.g., a fusion protein including a PDZ domain and any of a variety of epitope tags, 
biotinylation signals and the like) so long as the polypeptide can be immobilized In an 
orientation that does not abolish the ligand binding properties of the PDZ domain, e.g, by 

25 tethering the polypeptide to the surface via the non-PDZ domain via an anti-domain antibody 
and leaving the PDZ domain as the free end. It was discovered, for example, reacting a PDZ- 
GST fusion polypeptide directly to a plastic plate provided suboptimal results. The calculation 
of binding affinity itself can be determined using any suitable equation (e.g., as shown supra\ 
also see Cantor and Schimmel (1980) Biophysical Chemistry WH Freeman & Co., San 

30 Francisco) or software. 

Thus, in a preferred embodiment, the polypeptide is immobilized by binding the 
polypeptide to an immobilized immunoglobulin that binds the non-PDZ domain (e.g., an anti- 
GST antibody when a GST-PDZ fusion polypeptide is used). In a preferred embodiment, the 
step of contacting the ligand and PDZ-domain polypeptide is carried out under the conditions 

35 provided supra in the description of the "G" assay. It will be appreciated that binding assays 
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are conveniently carried out in multiwell plates (e.g., 24-well, 96-well plates, or 384 well 
plates). 

The present method has considerable advantages over other methods for 
measuring binding affinities PDZ-PL affinities, which typically involve contacting varying 
5 concentrations of a GST-PDZ fusion protein to a ligand-coated surface. For example, some 
previously described methods for determining affinity (e.g., using immobilized ligand and 
GST-PDZ protein in solution) did not account for oligomerization state of the fusion proteins 
used, resulting in potential errors of more than an order of magnitude. 

Although not sufficient for quantitative measurement of PDZ-PL binding 
10 affinity, an estimate of the relative strength of binding of different PDZ-PL pairs can be made 
based on the absolute magnitude of the signals observed in the "G assay." This estimate will 
reflect several factors, including biologically relevant aspects of the interaction, including the 
affinity and the dissociation rate. For comparisons of different ligands binding to a given PDZ 
domain-containing protein, differences in absolute binding signal likely relate primarily to the 
15 affinity and/or dissociation rate of the interactions of interest. 

IX. Assays to Identify Novel PDZ Domain Binding Moieties and to Identify Modulators of 
PDZ Protein-PL Protein Binding 

Although described supra primarily in terms of identifying interactions between 

20 PDZ-domain polypeptides and PL proteins, the assays described supra and other assays can 
also be used to identify the binding of other molecules (e.g., peptide mimetics, small molecules, 
and the like) to PDZ domain sequences. For example, using the assays disclosed herein, 
combinatorial and other libraries of compounds can be screened, e.g., for molecules that 
specifically bind to PDZ domains. Screening of libraries can be accomplished by any of a 

25 variety of commonly known methods. See, e.g., the following references, which disclose 
screening of peptide libraries: Parmley and Smith, 1989, Adv. Exp. Med. Biol. 251:215-218; 
Scott and Smith, 1990, Science 249:386-390; Fowlkes etal., 1992; BioTechniques 13:422-427; 
Oldenburg et al., 1992, Proc. Natl Acad. Sci. USA 89:5393-5397; Yu et al., 1994, Cell 76:933- 
945; Staudtetal., 19&&, Science 241:577-580; Bock etal., 1992, Nature 355:564-566; Tuerket 

30 al., 1992, Proc Natl. Acad. Sci. USA 89:6988-6992; Ellington et al., 1992, Nature 355:850- 
852; U.S. PatentNo. 5,096,815, U.S. Patent No. 5,223,409, and U.S. Patent No. 5,198,346, all 
to Ladner et al.; Rebar and Pabo, 1993, Science 263:671-673; and PCT Publication No. 
WO 94/18318. 
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In a specific embodiment, screening can be carried out by contacting the library 
members with a PDZ-domain polypeptide immobilized on a solid support (e.g. as described 
supra in the "G" assay) and harvesting those library members that bind to the protein. 
Examples of such screening methods, termed "panning" techniques are described by way of 
5 example inParmley and Smith, 1988, Gene 73:305-318; Fowlkes etal., 1992, BioTechniques 
13:422-427; PCT Publication No. WO 94/18318; and in references cited hereinabove. 

In another embodiment, the two-hybrid system for selecting interacting proteins 
in yeast (Fields and Song, 1989, Nature 340:245-246; Chienetal., 1991, Proc. Natl. Acad. ScL 
USA 88:9578-9582) can be used to identify molecules that specifically bind to a PDZ domain- 
1 0 containing protein. Furthermore, the identified molecules are further tested for their ability to 
inhibit transmembrane receptor interactions with a PDZ domain. 

In one aspect of the invention, antagonists of an interaction between a PDZ 
protein and a PL protein are identified. In one embodiment, a modification of the "A" assay 
described supra is used to identify antagonists. In one embodiment, a modification of the "G" 
15 assay described supra is used to identify antagonists. 

In one embodiment, screening assays are used to detect molecules that 
specifically bind to PDZ domains. Such molecules are useful as agonists or antagonists of 
PDZ-protein-mediated cell function (e.g., cell activation, e.g., T cell activation, vesicle 
transport, cytokine release, growth factors, transcriptional changes, cytoskeleton rearrangement, 
20 cell movement, chemotaxis, and the like). In one embodiment, such assays are performed to 
screen for leukocyte activation inhibitors for drug development. The invention thus provides 
assays to detect molecules that specifically bind to PDZ domain-containing proteins. For 
example, recombinant cells expressing PDZ domain-encoding nucleic acids can be used to 
produce PDZ domains in these assays and to screen for molecules that bind to the domains. 
25 Molecules are contacted with the PDZ domain (or fragment thereof) under conditions 
conducive to binding, and then molecules that specifically bind to such domains are identified. 
Methods that can be used to carry out the foregoing are commonly known in the art. 

It will be appreciated by the ordinarily skilled practitioner that, in one 
embodiment, antagonists are identified by conducting the A or G assays in the presence and 
30 absence of a known or candidate antagonist. When decreased binding is observed in the 
presence of a compound, that compound is identified as an antagonist Increased binding in the 
presence of a compound signifies that the compound is an agonist. 

For example, in one assay, a test compound can be identified as an inhibitor 
(antagonist) of binding between a PDZ protein and a PL protein by contacting a PDZ domain 
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polypeptide and a PL peptide or protein in the presence and absence of the test compound, 
under conditions in which they would (but for the presence of the test compound) form a 
complex, and detecting the formation of the complex in the presence and absence of the test 
compound. It will be appreciated that less complex formation in the presence of the test 
5 compound than in the absence of the compound indicates that the test compound is an inhibitor 
of a PDZ protein -PL protein binding. 

In one embodiment, the "G" assay is used in the presence or absence of an 
candidate inhibitor. In one embodiment, the "A" assay is used in the presence or absence of a 
candidate inhibitor. 

10 In one embodiment (in which a G assay is used), one or more PDZ domain- 

containing GST-fusion proteins are bound to the surface of wells of a 96-well plate as described 
supra (with appropriate controls including nonfusion GST protein). All fusion proteins are 
bound in multiple wells so that appropriate controls and statistical analysis can be done. A test 
compound in BS A/PBS (typically at multiple different concentrations) is added to wells. 

1 5 Immediately thereafter, 30 uL of a detectably labeled (e.g., biotinylated) PL peptide or protein 
known to bind to the relevant PDZ domain (see, e.g., TABLE 2) is added in each of the wells 
at a final concentration of, e.g., between about 2 and about 40 |iM, typically 5 ^iM, 15 jjM, 
or 25 (iM. This mixture is then allowed to react with the PDZ fusion protein bound to the 
surface for 10 minutes at 4°C followed by 20 minutes at 25°C. The surface is washed free of 

20 unbound PL polypeptide three times with ice cold PBS and the amount of binding of the 
polypeptide in the presence and absence of the test compound is determined. Usually, the level 
of binding is measured for each set of replica wells (e.g. duplicates) by subtracting the mean 
GST alone background from the mean of the raw measurement of polypeptide binding in these 
wells. 

25 In an alternative embodiment, the A assay is carried out in the presence or 

absence of a test candidate to identify inhibitors of PL-PDZ interactions. 

In one embodiment, a test compound is determined to be a specific inhibitor of 
the binding of the PDZ domain (P) and a PL (L) sequence when, at a test compound 
concentration of less than or equal to 1 mM (e.g., less than or equal to: 500 jaM, 100 jiM, 10 

30 )LiM, 1 nM, 100 nM or 1 nM) the binding of P to L in the presence of the test compound less 
than about 50% of the binding in the absence of the test compound, (in various embodiments, 
less than about 25%, less than about 10%, or less than about 1%). Preferably, the net signal of 
binding of P to L in the presence of the test compound plus six (6) times the standard error of 
the signal in the presence of the test compound is less than the binding signal in the absence of 
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the test compound. 

In one embodiment, assays for an inhibitor are carried out using a single PDZ 
protein-PL protein pair (e.g., a PDZ domain fusion protein and a PL peptide or protein). In a 
related embodiment, the assays are carried out using a plurality of pairs, such as a plurality of 
5 different pairs listed in TABLE 2. 

In some embodiments, it is desirable to identify compounds that, at a given 
concentration, inhibit the binding of one PL-PDZ pair, but do not inhibit (or inhibit to a lesser 
degree) the binding of a specified second PL-PDZ pair. These antagonists can be identified by 
carrying out a series of assays using a candidate inhibitor and different PL-PDZ pairs (e.g., as 

1 0 shown in the matrix of TABLE 2) and comparing the results of the assays. All such pairwise 
combinations are contemplated by the invention (e.g., test compound inhibits binding of PLj to 
PDZj to a greater degree than it inhibits binding of PL! to PDZ 2 or PL 2 to PDZ 2 ). Importantly, 
it will be appreciated that, based on the data provided in TABLE 2 and disclosed herein (and 
additional data that can be generated using the methods described herein) inhibitors with 

15 different specificities can readily be designed. 

For example, according to the invention, the Ki ("potency") of an inhibitor of a 
PDZ-PL interaction can be determined. Ki is a measure of the concentration of an inhibitor 
required to have a biological effect. For example, administration of an inhibitor of a PDZ-PL 
interaction in an amount sufficient to result in an intracellular inhibitor concentration of at least 

20 between about 1 and about 100 Ki is expected to inhibit the biological response mediated by 
the target PDZ-PL interaction. In one aspect of the invention, the Kd measurement of PDZ-PL 
binding as determined using the methods supra is used in determining Ki. 

Thus, in one aspect, the invention provides a method of determining the potency 
(Ki) of an inhibitor or suspected inhibitor of binding between a PDZ domain and a ligand by 

25 immobilizing a polypeptide comprising the PDZ domain and a non-PDZ domain on a surface, 
contacting the immobilized polypeptide with a plurality of different mixtures of the ligand and 
inhibitor, wherein the different mixtures comprise a fixed amount of ligand and different 
concentrations of the inhibitor, determining the amount of ligand bound at the different 
concentrations of inhibitor, and calculating the Ki of the binding based on the amount of ligand 

30 bound in the presence of different concentrations of the inhibitor. In an embodiment, the 
polypeptide is immobilized by binding the polypeptide to an immobilized immunoglobulin that 
binds the non-PDZ domain. This method, which is based on the "G" assay described supra, is 
particularly suited for high-throughput analysis of the Ki for inhibitors of PDZ-ligand 
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interactions. Further, using this method, the inhibition of the PDZ-ligand interaction itself is 
measured, without distortion of measurements by avidity effects. 

Typically, at least a portion of the ligand is detectably labeled to permit easy 
quantitation of ligand binding. 
5 It will be appreciated that the concentration of ligand and concentrations of 

inhibitor are selected to allow meaningful detection of inhibition. Thus, the concentration of 
the ligand whose binding is to be blocked is close to or less than its binding affinity (e.g., 
preferably less than the 5x Kd of the interaction, more preferably less than 2x Kd, most 
preferably less than 1 x Kd). Thus, the ligand is typically present at a concentration of less than 
10 2 Kd (e.g., between about 0.01 Kd and about 2 Kd) and the concentrations of the test inhibitor 
typically range from 1 nM to 100 ^iM (e.g. a 4-fold dilution series with highest concentration 
10 jjM or 1 mM). In a preferred embodiment, the Kd is determined using the assay disclosed 
supra. 

The Ki of the binding can be calculated by any of a variety of methods routinely 

1 5 used in the art, based on the amount of ligand bound in the presence of different concentrations 
of the inhibitor, in an illustrative embodiment, for example, a plot of labeled ligand binding 
versus inhibitor concentration is fit to the equation: 

S inhft , itor =S 0 *Ki/([I]+Ki) 
where is the signal of labeled ligand binding to immobilized PDZ domain in the 

20 presence of inhibitor at concentration [I] and S 0 is the signal in the absence of inhibitor (i.e., [I] 
= 0). Typically [I] is expressed as a molar concentration. 

In another aspect of the invention, an enhancer (sometimes referred to as, 
augmentor or agonist) of binding between a PDZ domain and a ligand is identified by 
immobilizing a polypeptide comprising the PDZ domain and a non-PDZ domain on a surface, 

25 contacting the immobilized polypeptide with the ligand in the presence of a test agent and 
determining the amount of ligand bound, and comparing the amount of ligand bound in the 
presence of the test agent with the amount of ligand bound by the polypeptide in the absence of 
the test agent. At least two-fold (often at least 5-fold) greater binding in the presence of the test 
agent compared to the absence of the test agent indicates that the test agent is an agent that 

30 enhances the binding of the PDZ domain to the ligand. As noted supra, agents that enhance 
PDZ-ligand interactions are useful for disruption (dysregulation) of biological events requiring 
normal PDZ-ligand function (e.g., cancer cell division and metastasis, and activation and 
migration of immune cells). 
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The invention also provides methods for determining the "potency" or "K^^" 
of an enhancer of a PDZ- ligand interaction. For example, according to the invention, the 
K enhancer of an enhancer of a PDZ-PL interaction can be determined, e.g., using the Kd of PDZ- 
PL binding as determined using the methods described supra. K^^ is a measure of the 
5 concentration of an enhancer expected to have a biological effect. For example, administration 
of an enhancer of a PDZ-PL interaction in an amount sufficient to result in an intracellular 
inhibitor concentration of at least between about 0. 1 and about 100 (e.g., between about 

0.5 and about 50 K^^J is expected to disrupt the biological response mediated by the target 
PDZ-PL interaction. 

1 0 Thus, in one aspect the invention provides a method of determining the potency 

(^^3^) of an enhancer or suspected enhancer of binding between a PDZ domain and a ligand 
by immobilizing a polypeptide comprising the PDZ domain and a non-PDZ domain on a 
surface, contacting the immobilized polypeptide with a plurality of different mixtures of the 
ligand and enhancer, wherein the different mixtures comprise a fixed amount of ligand, at least 

15 a portion of which is detectably labeled, and different concentrations of the enhancer, 
determining the amount of ligand bound at the different concentrations of enhancer, and 
calculating the potency (K^^^) of the enhancer from the binding based on the amount of 
ligand bound in the presence of different concentrations of the enhancer. Typically, at least a 
portion of the ligand is detectably labeled to permit easy quantitation of ligand binding. This 

20 method, which is based on the "G" assay described supra, is particularly suited for high- 
throughput analysis of the K enhancer for enhancers of PDZ-ligand interactions. 

It will be appreciated that the concentration of ligand and concentrations of 
enhancer are selected to allow meaningful detection of enhanced binding. Thus, the ligand is 
typically present at a concentration of between about 0.01 Kd and about 0.5 Kd and the 

25 concentrations of the test agent/enhancer typically range from 1 nM to 1 mM (e.g. a 4-fold 
dilution series with highest concentration 1 0 jiM or 1 mM). In a preferred embodiment, the Kd 
is determined using the assay disclosed supra. 

The potency of the binding can be determined by a variety of standard methods 
based on the amount of ligand bound in the presence of different concentrations of the enhancer 

30 or augmentor. For example, a plot of labeled ligand binding versus enhancer concentration can 
be fit to the equation: 

S([E]) = S(0) + (S(0)*(D enhancer -l)*[E]/([E]+ K^J 
where "K^^/' is the potency of the augmenting compound, and "D enhancer " is the fold-increase 
in binding of the labeled ligand obtained with addition of saturating amounts of the enhancing 
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compound, [E] is the concentration of the enhancer. It will be understood that saturating 
amounts are the amount of enhancer such that further addition does not significantly increase 
the binding signal. Knowledge of "K^^" is useful because it describes a concentration of the 
augmenting compound in a target cell that will result in a biological effect due to dysregulation 
5 of the PDZ-PL interaction. Typical therapeutic concentrations are between about 0. 1 and about 
100 K enhancer 



X. Global Analysis of PDZ-PL Interactions 

10 As described supra, the present invention provides powerful methods for 

analysis of PDZ-ligand interactions, including high-throughput methods such as the "G" assay 
and affinity assays described supra. In one embodiment of the invention, the affinity is 
determined for a particular ligand and a plurality of PDZ proteins. Typically the plurality is at 
least 5, and often at least 25, or at least 40 different PDZ proteins. In a preferred embodiment, 

1 5 the plurality of different PDZ proteins are from a particular tissue (e.g., central nervous system, 
spleen, cardiac muscle, kidney) or a particular class or type of cell, (e.g., a hematopoietic cell, a 
lymphocyte, a neuron) and the like. In a most preferred embodiment, the plurality of different 
PDZ proteins represents a substantial fraction (e.g., typically a majority, more often at least 
80%) of all of the PDZ proteins known to be, or suspected of being, expressed in the tissue or 

20 cell(s), e.g., all of the PDZ proteins known to be present in lymphocytes. In an embodiment, 
the plurality is at least 50%, usually at least 80%, at least 90% or all of the PDZ proteins 
disclosed herein as being expressed in hematopoietic cells. 

In one embodiment of the invention, the binding of a ligand to the plurality of 
PDZ proteins is determined. Using this method, it is possible to identify a particular PDZ 

25 domain bound with particular specificity by the ligand. The binding may be designated as 
"specific" if the affinity of the ligand to the particular PDZ domain is at least 2-fold that of the 
binding to other PDZ domains in the plurality (e.g., present in that cell type). The binding is 
deemed "very specific" if the affinity is at least 10-fold higher than to any other PDZ in the 
plurality or, alternatively, at least 10-fold higher than to at least 90%, more often 95% of the 

30 other PDZs in a defined plurality. Similarly, the binding is deemed "exceedingly specific" if it 
is at least 100-fold higher. For example, a ligand could bind to 2 different PDZs with an 
affinity of 1 uM and to no other PDZs out of a set 40 with an affinity of less than 1 00 uM. This 
would constitute specific binding to those 2 PDZs. Similar measures of specificity are used to 
describe binding of a PDZ to a plurality of PLs. 
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It will be recognized that high specificity PDZ-PL interactions represent 
potentially more valuable targets for achieving a desired biological effect. The ability of an 
inhibitor or enhancer to act with high specificity is often desirable. In particular, the most 
specific PDZ-ligand interactions are also the best therapeutic targets, allowing specific 
5 inhibition of the interaction. 

Thus, in one embodiment, the invention provides a method of identifying a high 
specificity interaction between a particular PDZ domain and a ligand known or suspected of 
binding at least one PDZ domain, by providing a plurality of different immobilized 
polypeptides, each of said polypeptides comprising a PDZ domain and a non-PDZ domain; 

1 0 determining the affinity of the ligand for each of said polypeptides, and comparing the affinity 
of binding of the ligand to each of said polypeptides, wherein an interaction between the ligand 
and a particular PDZ domain is deemed to have high specificity when the ligand binds an 
immobilized polypeptide comprising the particular PDZ domain with at least 2-fold higher 
affinity than to immobilized polypeptides not comprising the particular PDZ domain. 

15 In a related aspect, the affinity of binding of a specific PDZ domain to a plurality 

of ligands (or suspected ligands) is determined. For example, in one embodiment, the invention 
provides a method of identifying a high specificity interaction between a PDZ domain and a 
particular ligand known or suspected of binding at least one PDZ domain, by providing an 
immobilized polypeptide comprising the PDZ domain and a non-PDZ domain; determining the 

20 affinity of each of a plurality of ligands for the polypeptide, and comparing the affinity of 
binding of each of the ligands to the polypeptide, wherein an interaction between a particular 
ligand and the PDZ domain is deemed to have high specificity when the ligand binds an 
immobilized polypeptide comprising the PDZ domain with at least 2-fold higher affinity than 
other ligands tested. Thus, the binding may be designated as "specific" if the affinity of the 

25 PDZ to the particular PL is at least 2-fold that of the binding to other PLs in the plurality (e.g., 
present in that cell type). The binding is deemed "very specific" if the affinity is at least 10-fold 
higher than to any other PL in the plurality or, alternatively, at least 10-fold higher than to at 
least 90%, more often 95% of the other PLs in a defined plurality. Similarly, the binding is 
deemed "exceedingly specific" if it is at least 1 00-fold higher. Typically the plurality is at least 

30 5 different ligands, more often at least 10. 

A. Use of Array for Global Predictions 

One discovery of the present inventors relates to the important and extensive 
roles played by interactions between PDZ proteins and PL proteins, particularly in the 
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biological function of hematopoietic cells and other cells involved in the immune response. 
Further, it has been discovered that valuable information can be ascertained by analysis (e.g., 
simultaneous analysis) of a large number of PDZ-PL interactions. In a preferred embodiment, 
the analysis encompasses all of the PDZ proteins expressed in a particular tissue (e.g., spleen) 
5 or type or class of cell (e.g., hematopoietic cell, neuron, lymphocyte, B cell, T cell and the like). 
Alternatively, the analysis encompasses at least about 5, or at least about 10, or at least about 
12, or at least about 15 and often at least 50 different polypeptides, up to about 60, about 80, 
about 100, about 1 50, about 200, or even more different polypeptides; or a substantial fraction 
(e.g., typically a majority, more often at least 80%) of all of the PDZ proteins known to be, or 

10 suspected of being, expressed in the tissue or cell(s), e.g., all of the PDZ proteins known to be 
present in lymphocytes. 

It will be recognized that the arrays and methods of the invention are directed to 
analyze of PDZ and PL interactions, and involve selection of such proteins for analysis. While 
the devices and methods of the invention may include or involve a small number of control 

15 polypeptides, they typically do not include significant numbers of proteins or fusion proteins 
that do not include either PDZ or PL domains (e.g., typically, at least about 90% of the arrayed 
or immobilized polypeptides in a method or device of the invention is a PDZ or PL sequence 
protein, more often at least about 95%, or at least about 99%). 

It will be apparent from this disclosure that analysis of the relatively large 

20 number of different interactions preferably takes place simultaneously. In this context, 
"simultaneously" means that the analysis of several different PDZ-PL interactions (or the effect 
of a test agent on such interactions) is assessed at the same time. Typically the analysis is 
carried out in a high throughput (e.g., robotic) fashion. One advantage of this method of 
simultaneous analysis is that it permits rigorous comparison of multiple different PDZ-PL 

25 interactions. For example, as explained in detail elsewhere herein, simultaneous analysis (and 
use of the arrays described infra) facilitates, for example, the direct comparison of the effect of 
an agent (e.g., an potential interaction inhibitor) on the interactions between a substantial 
portion of PDZs and/or PLs in a tissue or cell. 

Accordingly, in one aspect, the invention provides an array of immobilized 

30 polypeptide comprising the PDZ domain and a non-PDZ domain on a surface. Typically, the 
array comprises at least about 5, or at least about 10, or at least about 12, or at least about 1 5 
and often at least 50 different polypeptides. In one preferred embodiment, the different PDZ 
proteins are from a particular tissue (e.g., central nervous system, spleen, cardiac muscle, 
kidney) or a particular class or type of cell, (e.g., a hematopoietic cell, a lymphocyte, a neuron) 
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and the like. In a most preferred embodiment, the plurality of different PDZ proteins represents 
a substantial fraction (e.g., typically a majority, more often at least 60%, 70% or 80%) of all of 
the PDZ proteins known to be, or suspected of being, expressed in the tissue or cell(s), e.g., all 
of the PDZ proteins known to be present in lymphocytes. 
5 Certain embodiments are arrays which include a plurality, usually at least 5, 10, 

25, 50 PDZ proteins present in a particular cell of interest. In this context, "array" refers to an 
ordered series of immobilized polypeptides in which the identity of each polypeptide is 
associated with its location. In some embodiments the plurality of polypeptides are arrayed in a 
"common" area such that they can be simultaneously exposed to a solution (e.g., containing a 

10 ligand or test agent). For example, the plurality of polypeptides can be on a slide, plate or 
similar surface, which may be plastic, glass, metal, silica, beads or other surface to which 
proteins can be immobilized. In a different embodiment, the different immobilized 
polypeptides are situated in separate areas, such as different wells of multi-well plate (e.g., a 
24-well plate, a 96-well plate, a 384 well plate, and the like). It will be recognized that a 

1 5 similar advantage can be obtained by using multiple arrays in tandem. 

B. Analysis of PDZ-PL Inhibition Profile 

In one aspect, the invention provides a method for determining if a test 
compound inhibits any PDZ-ligand interaction in large set of PDZ-ligand interaction (e.g., a 

20 plurality of the PDZ-ligands interactions described in Table 2; a majority of the PDZ-ligands 
identified in a particular cell or tissue as described supra (e.g., lymphocytes) and the like. In 
one embodiment, the PDZ domains of interest are expressed as GST-PDZ fusion proteins and 
immobilized as described herein. For each PDZ domain, a labeled ligand that binds to the 
domain with a known affinity is identified as described herein. 

25 For any known or suspected modulator (e.g., inhibitor) of a PDL-PL 

interaction(s), it is useful to know which interactions are inhibited (or augmented). For 
example, an agent that inhibits all PDZ-PL interactions in a cell (e.g., a lymphocyte) will have 
different uses than an agent that inhibits only one, or a small number, of specific PDZ-PL 
interactions. The profile of PDZ interactions inhibited by a particular agent is referred to as the 

30 "inhibition profile" for the agent, and is described in detail below. The profile of PDZ 
interactions enhanced by a particular agent is referred to as the "enhancement profile" for the 
agent. It will be readily apparent to one of skill guided by the description of the inhibition 
profile how to determine the enhancement profile for an agent. The present invention provides 
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methods for determining the PDZ interaction (inhibition/enhancement) profile of an agent in a 
single assay. 

In one aspect, the invention provides a method for determining the PDZ-PL 
inhibition profile of a compound by providing (i) a plurality of different immobilized 
5 polypeptides, each of said polypeptides comprising a PDZ domain and a non-PDZ domain and 
(ii) a plurality of corresponding ligands, wherein each ligand binds at least one PDZ domain in 

(i) , then contacting each of said immobilized polypeptides in (i) with a corresponding ligand in 

(ii) in the presence and absence of a test compound, and determining for each polypeptide- 
ligand pair whether the test compound inhibits binding between the immobilized polypeptide 

10 and the corresponding ligand. 

Typically the plurality is at least 5, and often at least 25, or at least 40 different 
PDZ proteins. In a preferred embodiment, the plurality of different ligands and the plurality of 
different PDZ proteins are from the same tissue or a particular class or type of cell, e.g., a 
hematopoietic cell, a lymphocyte, a neuron and the like. In a most preferred embodiment, the 

15 plurality of different PDZs represents a substantial fraction (e.g., at least 80%) of all of the 
PDZs known to be, or suspected of being, expressed in the tissue or cell(s), e.g., all of the PDZs 
known to be present in lymphocytes (for example, at least 80%, at least 90% or all of the PDZs 
disclosed herein as being expressed in hematopoietic cells). 

In one embodiment, the inhibition profile is determined as follows: A plurality 

20 (e.g., all known) PDZ domains expressed in a cell (e.g., lymphocytes) are expressed as GST- 
fusion proteins and immobilized without altering their ligand binding properties as described 
supra. For each PDZ domain, a labeled ligand that binds to this domain with a known affinity 
is identified. If the set of PDZ domains expressed in lymphocytes is denoted by {PI . . .Pn}, any 
given PDZ domain Pi binds a (labeled) ligand Li with affinity K^i. To determine the inhibition 

25 profile for a test agent "compound X" the "G" assay (supra) can be performed as follows in 96- 
well plates with rows A-H and columns 1-12. Column 1 is coated with PI and washed. The 
corresponding ligand LI is added to each washed coated well of column 1 at a concentration 
0.5 Kjl with (rows B, D, F, H) or without (rows A, C, E, F) between about 1 and about 1000 
uM) of test compound X. Column 2 is coated with P2, and L2 (at a concentration 0.5 ¥^2) is 

30 added with or without inhibitor X. Additional PDZ domains and ligands are similarly tested. 

Compound X is considered to inhibit the binding of Li to Pi if the average signal 
in the wells of column i containing X is less than half the signal in the equivalent wells of the 
column lacking X. Thus, in this single assay one determines the full set of lymphocyte PDZs 
that are inhibited by compound X. 
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In some embodiments, the test compound X is a mixture of compounds, such as 
the product of a combinatorial chemistry synthesis as described supra. In some embodiments, 
the test compound is known to have a desired biological effect, and the assay is used to 
determine the mechanism of action (i.e., if the biological effect is due to modulating a PDZ-PL 
5 interaction). 

It will be apparent that an agent that modulates only one, or a few PDZ-PL 
interactions, in a panel (e.g., a panel of all known PDZs lymphocytes, a panel of at least 10, at 
least 20 or at least 50 PDZ domains) is a more specific modulator than an agent that modulate 
many or most interactions. Typically, an agent that modulates less than 20% of PDZ domains in 
10 a panel (e.g., Table 2) is deemed a "specific" inhibitor, less than 6% a "very specific" inhibitor, 
and a single PDZ domain a "maximally specific" inhibitor. 

It will also be appreciated that "compound X" may be a composition containing 
mixture of compounds (e.g., generated using combinatorial chemistry methods) rather than a 
single compound. 

15 Several variations of this assay are contemplated: 

In some alternative embodiments, the assay above is performed using varying 
concentrations of the test compound X, rather than fixed concentration. This allows 
determination of the Ki of the X for each PDZ as described above. 

In an alternative embodiment, instead of pairing each PDZ Pi with a specific 
20 labeled ligand Li, a mixture of different labeled ligands is created that such that for every PDZ 
at least one of the ligands in the mixture binds to this PDZ sufficiently to detect the binding in 
the "G" assay. This mixture is then used for every PDZ domain. 

In one embodiment, compound X is known to have a desired biological effect, 
but the chemical mechanism by which it has that effect is unknown. The assays of the invention 
25 can then be used to determine if compound X has its effect by binding to a PDZ domain. 

In one embodiment, PDZ-domain containing proteins are classified in to groups 
based on their biological function, e.g. into those that regulate chemotaxis versus those that 
regulate transcription. An optimal inhibitor of a particular function (e.g., including but not 
limited to an anti-chemotactic agent, an anti-T ceil activation agent, cell-cycle control, vesicle 
30 transport, apoptosis, etc.) will inhibit multiple PDZ-ligand interactions involved in the function 
(e.g., chemotaxis, activation) but few other interactions. Thus, the assay is used in one 
embodiment in screening and design of a drug that specifically blocks a particular function. 
For example, an agent designed to block chemotaxis might be identified because, at a given 
concentration, the agent inhibits 2 or more PDZs involved in chemotaxis but fewer than 3 other 
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PDZs, or that inhibits PDZs involved in chemotaxis with a Ki > 10-fold better than for other 
PDZs. Thus, the invention provides a method for identifying an agent that inhibits a first 
selected PDZ-PL interaction or plurality of interactions but does not inhibit a second selected 
PDZ-PL interaction or plurality of interactions. The two (or more) sets of interactions can be 
5 selected on the basis of the known biological function of the PDZ proteins, the tissue specificity 
of the PDZ proteins, or any other criteria. Moreover, the assay can be used to determine 
effective doses (i.e., drug concentrations) that result in desired biological effects while avoiding 
undesirable effects. 

10 C. Side Effects of PDZ-PL Modulator Interactions 

In a related embodiment, the invention provides a method for determining likely 
side effects of a therapeutic that inhibits PDZ-ligand interactions. The method entails 
identifying those target tissues, organs or cell types that express PDZ proteins and ligands that 
are disrupted by a specified inhibitor. If, at a therapeutic dosage, a drug intended to have an 

15 effect in one organ system (e.g., hematopoietic system) disrupts PDZ-PL interactions in a 
different system (e.g., CNS) it can be predicted that the drug will have effects ("side effects") 
on the second system. It will be apparent that the information obtained from this assay will be 
useful in the rational design and selection of drugs that do not have the side-effect. 

In one embodiment, for example, a comprehensive PDZ protein set is obtained. 

20 A "perfectly comprehensive"' PDZ protein set is defined as the set of all PDZ proteins 
expressed in the subject animal (e.g., humans). A comprehensive set may be obtained by 
analysis of, for example, the human genome sequence. However, a "perfectly comprehensive" 
set is not required and any reasonably large set of PDZ domain proteins (e.g., the set of all 
known PDZ proteins; or the set listed in TABLE 6) will provide valuable information. 

25 In one embodiment, the method involves some of all of the following steps: 

a) For each PDZ protein, determine the tissues in which it is highly 
expressed. This can be done experimentally although the information generally will be 
available in the scientific literature; 

b) For each PDZ protein (or as many as possible), identify the cognate 
30 PL(s) bound by the PDZ protein; 

c) Determine the Ki at which the test agent inhibits each PDZ-PL 
interaction, using the methods described supra: 

d) From this information it is possible to calculate the pattern of PDZ-PL 
interactions disrupted at various concentrations of the test agent 
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By correlating the set of PDZ-PL interactions disrupted with the expression pattern of the 
members of that set, it will be possible to identify the tissues likely affected by the agent. 

Additional steps can also be carried out, including determining whether a 
specified tissue or cell type is exposed to an agent following a particular route of 
5 administration. This can be determined using basis pharmacokinetic methods and principles. 

D. Modulation of Activities 

The PDZ binding moieties and PDZ protein -PL protein binding antagonists of 
the invention are used to modulate biological activities or functions of cells (e.g., hematopoietic 

1 0 cells, such as T cells and B cells and the like), endothelial cells, and other immune system cells, 
as described herein, and for treatment of diseases and conditions in human and nonhuman 
animals (e.g., experimental models). Exemplary biological activities are listed supra. 

When administered to patients, the compounds of the invention (e.g., PL-PDZ 
interaction inhibitors) are useful for treating (ameliorating symptoms of) a variety of diseases 

15 and conditions, including diseases characterized by inflammatory and humoral immune 
responses, e.g., inflammation, allergy (e.g., systemic anaphylaxis, hypersensitivity responses, 
drug allergies, insect sting allergies; inflammatory bowel diseases, ulcerative colitis, ileitis and 
enteritis; psoriasis and inflammatory dermatoses, scleroderma; respiratory allergic diseases 
such as asthma, allergic rhinitis, hypersensitivity lung diseases, and the like vasculitis, rh 

20 incompatibility, transfusion reactions, drug sensitivities, PIH, atopic dermatitis, eczema, 
rhinnitis; autoimmune diseases, such as arthritis (rheumatoid and psoriatic), multiple sclerosis, 
systemic lupus erythematosus, insulin-dependent diabetes, glomerulonephritis, scleroderma, 
MCTD, IDDM, Hashimoto thyroiditis, Goodpasture syndrome, psoriasis and the like, 
osteoarthritis, polyarthritis, graft rejection (e.g., allograft rejection, e.g., renal allograft 

25 rejection, graft-vs-host disease, transplantation rejection (cardiac, kidney, lung, liver, small 
bowel, cornea, pancreas, cadaver, autologous, bone marrow, xenotransplantation)), 
atherosclerosis, angiogenesis-dependent disorders, cancers (e.g., melanomas and breast cancer, 
prostrate cancer, leukemias, lymphomas, metastatic disease), infectious diseases (e.g., viral 
infection, such as HIV, measles, parainfluenza, virus-mediated cell fusion,), ischemia (e.g., 

30 post-myocardial infarction complications, joint injury, kidney, scleroderma). 



E. Agonists and Antagonists of PDZ-PL Interactions 

As described herein, interactions between PDZ proteins and PL proteins in cells 
(e.g., hematopoietic cells, e.g., T cells and B cells) may be disrupted or inhibited by the 
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administration of inhibitors or antagonists. Inhibitors can be identified using screening assays 
described herein. In embodiment, the motifs disclosed herein are used to design inhibitors. In 
some embodiments, the antagonists of the invention have a structure (e.g., peptide sequence) 
based on the C-termirial residues of PL-domain proteins listed in TABLE 3. In some 
5 embodiments, the antagonists of the invention have a structure (e.g., peptide sequence) based 
on a PL motif disclosed herein. 

The PDZ/PL antagonists and antagonists of the invention may be any of a large 
variety of compounds, both naturally occurring and synthetic, organic and inorganic, and 
including polymers (e.g., oligopeptides, polypeptides, oligonucleotides, and polynucleotides), 

1 0 small molecules, antibodies, sugars, fatty acids, nucleotides and nucleotide analogs, analogs of 
naturally occurring structures (e.g., peptide mimetics, nucleic acid analogs, and the like), and 
numerous other compounds. Although, for convenience, the present discussion primarily refers 
antagonists of PDZ-PL interactions, it will be recognized that PDZ-PL interaction agonists can 
also be use in the methods disclosed herein. 

1 5 In one aspect, the peptides and peptide mimetics or analogues of the invention 

contain an amino acid sequence that binds a PDZ domain in a cell of interest. In one 
embodiment, the antagonists comprise a peptide that has a sequence corresponding to the 
carboxy-terminal sequence of a PL protein listed in TABLE 3, e.g., a peptide listed TABLE 3. 
Typically, the peptide comprises at least the C-terminal two (3), three (3) or four (4) residues 

20 of the PL protein, and often the inhibitory peptide comprises more than four residues (e.g., at 
least five, six, seven, eight, nine, ten, twelve or fifteen residues) from the PL protein C- 
terminus. 

In some embodiments, the inhibitor is a peptide, e.g., having a sequence of a PL 
C-terminal protein sequence. 
25 In some embodiments, the antagonist is a fusion protein comprising such a 

sequence. Fusion proteins containing a transmembrane transporter amino acid sequence are 
particularly useful. 

In some embodiments, the inhibitor is conserved variant of the PL C-terminal 
protein sequence having inhibitory activity. 
30 In some embodiments, the antagonist is a peptide mimetic of a PL C-terminal 

sequence. 

In some embodiments, the inhibitor is a small molecule (i.e., having a molecular 
weight less than 1 kD). See, e.g. Section 6.5.4, infra. 
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F. Peptide Antagonists 

In one embodiment, the antagonists comprise a peptide that has a sequence of a 
PL protein carboxy-terminus listed in TABLE 3. The peptide comprises at least the C-terminal 
two (2) residues of the PL protein, and typically, the inhibitory peptide comprises more than 
5 two residues (e.g, at least three, four, five, six, seven, eight, nine, ten, twelve or fifteen residues) 
from the PL protein C-terminus. The peptide may be any of a variety of lengths (e.g., at least 2, 
at least 3, at least 4, at least 5, at least 6, at least 8, at least 10, or at least 20 residues) and may 
contain additional residues not from the PL protein. It will be recognized that short PL peptides 
are sometime used in the rational design of other small molecules with similar properties. 

10 Although most often, the residues shared by the inhibitory peptide with the PL 

protein are found at the C-terminus of the peptide. However, in some embodiments, the 
sequence is internal. Similarly, in some cases, the inhibitory peptide comprises residues from a 
PL sequence that is near, but not at the c-terminus of a PL protein (see, Gee et al., 1998, J 
Biological Chem. 273:21980-87). 

1 5 Sometime the PL protein carboxy-terminus sequence is referred to as the "core 

PDZ motif sequence" referring to the ability of the short sequence to interact with the PDZ 
domain. For example, in an embodiment, the "core PDZ motif sequence" contains the last four 
C-terminus amino acids. As described above, the four amino acid core of a PDZ motif 
sequence may contain additional amino acids at its amino terminus to further increase its 

20 binding affinity and/or stability. Thus, in one embodiment, the PDZ motif sequence peptide 
can be from four amino acids up to 15 amino acids. It is preferred that the length of the 
sequence to be 6-10 amino acids. More preferably, the PDZ motif sequence contains 8 amino 
acids. Additional amino acids at the amino terminal end of the core sequence may be derived 
from the natural sequence in each hematopoietic cell surface receptor or a synthetic linker. The 

25 additional amino acids may also be conservatively substituted. When the third residue from the 
C-terminus is S. T or Y, this residue may be phosphorylated prior to the use of the peptide. 

In some embodiments, the peptide and nonpeptide inhibitors of the are small, 
e.g., fewer than ten amino acid residues in length if a peptide. Further, it is reported that a 
limited number of ligand amino acids directly contact the PDZ domain (generally less than 

30 eight) (Kozlov et al, 2000, Biochemistry 39, 2572; Doyle et al, 1996, Cell 85, 1067) and that 
peptides as short as the C-terminal three amino acids often retain similar binding properties to 
longer (> 15) amino acids peptides (Yanagisawa et al, 1997, J. Biol. Chem. 272, 8539). 

G. Peptide Variants 
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Having identified PDZ binding peptides and PDZ-PL interaction inhibitory 
sequences, variations of these sequences can be made and the resulting peptide variants can be 
tested for PDZ domain binding or PDZ-PL inhibitory activity. In embodiments, the variants 
have the same or a different ability to bind a PDZ domain as the parent peptide. Typically, 
5 such amino acid substitutions are conservative, i.e., the amino acid residues are replaced with 
other amino acid residues having physical and/or chemical properties similar to the residues 
they are replacing. Preferably, conservative amino acid substitutions are those wherein an 
amino acid is replaced with another amino acid encompassed within the same designated class. 

10 H. Peptide Mimetics 

Having identified PDZ binding peptides and PDZ-PL interaction inhibitory 
sequences, peptide mimetics can be prepared using routine methods, and the inhibitory activity 
of the mimetics can be confirmed using the assays of the invention. Thus, in some 
embodiments, the antagonist is a peptide mimetic of a PL C-terminal sequence. The skilled 

15 artisan will recognize that individual synthetic residues and polypeptides incorporating 
mimetics can be synthesized using a variety of procedures and methodologies, which are well 
described in the scientific and patent literature, e.g., Organic Syntheses Collective Volumes, 
Gilman et al. (Eds) John Wiley & Sons, Inc., NY. Polypeptides incorporating mimetics can 
also be made using solid phase synthetic procedures, as described, e.g., by Di Marchi, et al., 

20 U.S. Pat. No. 5,422,426. Mimetics of the invention can also be synthesized using 
combinatorial methodologies. Various techniques for generation of peptide and 
peptidomimetic libraries are well known, and include, e.g., multipin, tea bag, and 
split-couple-mix techniques; see, e.g., al-Obeidi (1998) Mol. Biotechnol. 9:205-223; Hruby 
(1997) Curr. Opin. Chem. Biol. 1:114-119; Ostergaard (1997) Mol. Divers. 3:17-27; Ostresh 

25 (1996) Methods Enzymol. 267:220-234, 

I. Small Molecules 

In some embodiments, the inhibitor is a small molecule (i.e., having a molecular 
weight less than 1 kD). Methods for screening small molecules are well known in the art and 
include those described supra. 

30 

XII. Preparation of Peptides 

A. Chemical Synthesis 
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The peptides of the invention or analogues thereof, may be prepared using 
virtually any art-known technique for the preparation of peptides and peptide analogues. For 
example, the peptides may be prepared in linear form using conventional solution or solid 
phase peptide syntheses and cleaved from the resin followed by purification procedures 
5 (Creighton, 1 983, Protein Structures And Molecular Principles, W.H. Freeman and Co., N.Y.). 
Suitable procedures for synthesizing the peptides described herein are well known in the art. 
The composition of the synthetic peptides may be confirmed by amino acid analysis or 
sequencing (e.g., the Edman degradation procedure and mass spectroscopy). 

In addition, analogues and derivatives of the peptides can be chemically 

1 0 synthesized. The linkage between each amino acid of the peptides of the invention may be an 
amide, a substituted amide or an isostere of amide. Nonclassical amino acids or chemical 
amino acid analogues can be introduced as a substitution or addition into the sequence. Non- 
classical amino acids include, but are not limited to, the D-isomers of the common amino acids, 
a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, y-Abu, e-Ahx, 

15 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3 -amino propionic acid, ornithine, 
norleucine, norvaline, hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, P-alanine, fluoro-amino acids, designer amino 
acids such as P-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and 
amino acid analogues in general. Furthermore, the amino acid can be D (dextrorotary) or L 

20 (levorotary). 

B. Recombinant Synthesis 

If the peptide is composed entirely of gene-encoded amino acids, or a portion of 
it is so composed, the peptide or the relevant portion may also be synthesized using 
conventional recombinant genetic engineering techniques. For recombinant production, a 

25 polynucleotide sequence encoding a linear form of the peptide is inserted into an appropriate 
expression vehicle, i. e. , a vector which contains the necessary elements for the transcription and 
translation of the inserted coding sequence, or in the case of an RNA viral vector, the necessary 
elements for replication and translation. The expression vehicle is then transfected into a 
suitable target cell which will express the peptide. Depending on the expression system used, 

30 the expressed peptide is then isolated by procedures well-established in the art. Methods for 
recombinant protein and peptide production are well known in the art (see, e.g. , Maniatis et al , 
1989, Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, N.Y.; and 
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Ausubel et al, 1989, Current Protocols in Molecular Biology, Greene Publishing Associates 
and Wiley Interscience, N.Y.). 

A variety of host-expression vector systems may be utilized to express the 
peptides described herein. These include, but are not limited to, microorganisms such as 
5 bacteria transformed with recombinant bacteriophage DN A or plasmid DNA expression vectors 
containing an appropriate coding sequence; yeast or filamentous fungi transformed with 
recombinant yeast or fungi expression vectors containing an appropriate coding sequence; 
insect cell systems infected with recombinant virus expression vectors (e.g., baculovirus) 
containing an appropriate coding sequence; plant cell systems infected with recombinant virus 

10 expression vectors (e.g. , cauliflower mosaic virus or tobacco mosaic virus) or transformed with 
recombinant plasmid expression vectors (e.g., Ti plasmid) containing an appropriate coding 
sequence; or animal cell systems. 

The expression elements of the expression systems vary in their strength and 
specificities. Depending on the host/vector system utilized, any of a number of suitable 

1 5 transcription and translation elements, including constitutive and inducible promoters, may be 
used in the expression vector. For example, when cloning in bacterial systems, inducible 
promoters such as pL of bacteriophage X, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the 
like may be used; when cloning in insect cell systems, promoters such as the baculovirus 
polyhedron promoter may be used; when cloning in plant cell systems, promoters derived from 

20 the genome of plant cells (e.g., heat shock promoters; the promoter for the small subunit of 
RUBISCO; the promoter for the chlorophyll a/b binding protein) or from plant viruses (e.g. , the 
35S RNA promoter of CaMV ; the coat protein promoter of TMV) may be used; when cloning 
in mammalian cell systems, promoters derived from the genome of mammalian cells (e.g., 
metallothionein promoter) or from mammalian viruses (e.g., the adenovirus late promoter; the 

25 vaccinia virus 7.5 K promoter) may be used; when generating cell lines that contain multiple 
copies of expression product, SV40-, BPV- and EBV-based vectors may be used with an 
appropriate selectable marker. 

In cases where plant expression vectors are used, the expression of sequences 
encoding the peptides of the invention may be driven by any of a number of promoters. For 

30 example, viral promoters such as the 35S RNA and 19S RNA promoters of CaMV (Brisson et 
al ,1984, Nature 3 1 0: 5 1 1 -5 1 4), or the coat protein promoter of TMV (Takamatsu et al , 1 987, 
EMBO J. 6:307-3 1 1) may be used; alternatively, plant promoters such as the small subunit of 
RUBISCO (Coruzzi etal, 1984, EMBO J. 3:1671-1680; Broglie etal, 1984, Science 224:838- 
843) or heat shock promoters, e.g. , soybean hsp 1 7.5-E or hsp 1 7.3-B (Gurley et al , 1 986, Mol. 



Cell Biol. 6:559-565) may be used. These constructs can be introduced into planleukocytes 
using Ti plasmids, Ri plasmids, plant virus vectors, direct DNA transformation, microinjection, 
electroporation, etc. For reviews of such techniques see, e.g., Weissbach & Weissbach, 1988, 
Methods for Plant Molecular Biology, Academic Press, NY, Section VIII, pp. 421-463; and 
5 Grierson & Corey, 1988, Plant Molecular Biology, 2d Ed., Blackie, London, Ch. 7-9. 

In one insect expression system that may be used to produce the peptides of the 
invention, Autographa californica nuclear polyhidrosis virus (AcNPV) is used as a vector to 
express the foreign genes. The virus grows in Spodoptera frugiperda cells. A coding sequence 
may be cloned into non-essential regions (for example the polyhedron gene) of the virus and 

10 placed under control of an AcNPV promoter (for example, the polyhedron promoter). 
Successful insertion of a coding sequence will result in inactivation of the polyhedron gene and 
production of non-occluded recombinant virus (i. e. , virus lacking the proteinaceous coat coded 
for by the polyhedron gene). These recombinant viruses are then used to infect Spodoptera 
frugiperda cells in which the inserted gene is expressed, {e.g., see Smith et al, 1983, J. Virol. 

15 46: 5 84; Smith, U. S . Patent No. 4,2 1 5,05 1 ) . Further examples of this expression system may be 
found in Current Protocols in Molecular Biology, Vol. 2, Ausubel et al, eds., Greene Publish. 
Assoc. & Wiley Interscience. 

In mammalian host cells, a number of viral based expression systems may be 
utilized. In cases where an adenovirus is used as an expression vector, a coding sequence may 

20 be ligated to an adenovirus transcription/translation control complex, e.g. , the late promoter and 
tripartite leader sequence. This chimeric gene may then be inserted in the adenovirus genome 
by in vitro or in vivo recombination. Insertion in a non-essential region of the viral genome 
{e.g. , region El or E3) will result in a recombinant virus that is viable and capable of expressing 
peptide in infected hosts, {e.g., See Logan & Shenk, 1984, Proc. Natl. Acad. Sci. USA 

25 8 1 :3655-3659). Alternatively, the vaccinia 7.5 K promoter may be used, {see, e.g. , Mackett et 
al, 1982, Proc. Natl. Acad. Sci. USA 79:7415-7419; Mackett etal, 1984, J. Virol. 49:857-864; 
Panicali et al, 1982, Proc. Natl. Acad. Sci. USA 79:4927-4931). 

Other expression systems for producing linear peptides of the invention will be 
apparent to those having skill in the art. 

30 Purification of the Peptides and Peptide Analogues 

The peptides and peptide analogues of the invention can be purified by art- 
known techniques such as high performance liquid chromatography, ion exchange 
chromatography, gel electrophoresis, affinity chromatography and the like. The actual 
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conditions used to purify a particular peptide or analogue will depend, in part, on factors such 
as net charge, hydrophobicity, hydrophilicity, etc., and will be apparent to those having skill in 
the art. The purified peptides can be identified by assays based on their physical or functional 
properties, including radioactive labeling followed by gel electrophoresis, radioimmuno-assays, 
5 ELISA, bioassays, and the like. 

For affinity chromatography purification, any antibody which specifically binds 
the peptides or peptide analogues may be used. For the production of antibodies, various host 
animals, including but not limited to rabbits, mice, rats, etc., may be immunized by injection 
with a peptide. The peptide may be attached to a suitable carrier, such as BSA or KLH, by 

10 means of a side chain functional group or linkers attached to a side chain functional group. 
Various adjuvants may be used to increase the immunological response, depending on the host 
species, including but not limited to Freund's (complete and incomplete), mineral gels such as 
aluminum hydroxide, surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dinitrophenol, and potentially 

1 5 useful human adjuvants such as BCG (bacilli Calmette-Guerin) and Corynebacterium parvum. 

Monoclonal antibodies to a peptide may be prepared using any technique which 
provides for the production of antibody molecules by continuous cell lines in culture. These 
include but are not limited to the hybridoma technique originally described by Koehler and 
Milstein, 1975, Nature 256:495-497, the human B-cell hybridoma technique, Kosbor et aL, 

20 1983, Immunology Today 4:72; Cote et al, 1983, Proc. Natl. Acad. Sci. U.S.A. 80:2026-2030 
and the EBV-hybridoma technique (Cole et al, 1985, Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)). In addition, techniques developed for the 
production of "chimeric antibodies" (Morrison et aL 9 1984, Proc. Natl. Acad. Sci. U.S.A. 
81:6851-6855; Neuberger et al, 1984, Nature 312:604-608; Takeda et al, 1985, Nature 

25 314:452-454) by splicing the genes from a mouse antibody molecule of appropriate antigen 
specificity together with genes from a human antibody molecule of appropriate biological 
activity can be used. Alternatively, techniques described for the production of single chain 
antibodies (U.S. Patent No. 4,946,778) can be adapted to produce peptide-specific single chain 
antibodies. 

30 Antibody fragments which contain deletions of specific binding sites may be 

generated by known techniques. For example, such fragments include but are not limited to 
F(ab') 2 fragments, which can be produced by pepsin digestion of the antibody molecule and Fab 
fragments, which can be generated by reducing the disulfide bridges of the F(ab') 2 fragments. 
Alternatively, Fab expression libraries may be constructed (Huse et al, 1989, Science 
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246: 1275-1281) to allow rapid and easy identification of monoclonal Fab fragments with the 
desired specificity for the peptide of interest. 

The antibody or antibody fragment specific for the desired peptide can be 
attached, for example, to agarose, and the antibody-agarose complex is used in 
5 immunochromatography to purify peptides of the invention. See, Scopes, 1984, Protein 
Purification: Principles and Practice, Springer-Verlag New York, Inc., NY, Livingstone, 1974, 
Methods Enzymology: ImmunoafSnity Chromatography of Proteins 34:723-731. 

XIII. Uses of PDZ Domain Binding and Antagonist Compounds 

As indicated in the Background section, PDZ domain-containing proteins are 

1 0 involved in a number of biological functions, including, but not limited to, vesicular trafficking, 
tumor suppression, protein sorting, establishment of membrane polarity, apoptosis, regulation 
of immune response and organization of synapse formation. In general, this family of proteins 
has a common function of facilitating the assembly of multi-protein complexes, often serving 
as a bridge between several proteins, or regulating the function of other proteins. Additionally, 

1 5 as also noted supra, these proteins are found in essentially all cell types. 

Consequently, modulation of these interactions can be utilized to control a wide 
variety of biological conditions and physiological conditions. In particular, modulation of 
interactions such as those disclosed herein can be utilized to control movement of vesicles 
within a cell, inhibition of tumor formation, as well as in the treatment of immune disorders, 

20 neurological disorders, muscular disorders, and intestinal disorders. 

Certain compounds which modulate binding of the PDZ proteins and PL 
proteins can be used to inhibit leukocyte activation, which is manifested in measurable events 
including but not limited to, cytokine production, cell adhesion, expansion of cell numbers, 
apoptosis and cytotoxicity. Thus, some compounds of the invention can be used to treat diverse 

25 conditions associated with undesirable leukocyte activation, including but not limited to, acute 
and chronic inflammation, graft-versus-host disease, transplantation rejection, hypersensitivities 
and autoimmunity such as multiple sclerosis, rheumatoid arthritis, peridontal disease, systemic 
lupus erythematosis, juvenile diabetes mellitis, non-insulin-dependent diabetes, and allergies, 
and other conditions listed herein (see, e.g., Section 6.4, supra). 

30 Thus, the invention also relates to methods of using such compositions in 

modulating leukocyte activation as measured by, for example, cytotoxicity, cytokine 
production, cell proliferation, and apoptosis. 
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XIV. Formulation and Route of Administration 

A. Introduction of Agonists or Antagonists (e.g.. Peptides and Fusion Proteins) into 

Cells 

In one aspect, the PDZ-PL antagonists of the invention are introduced into a cell 
5 to modulate (i.e., increase or decrease) a biological function or activity of the cell. Many small 
organic molecules readily cross the cell membranes (or can be modified by one of skill using 
routine methods to increase the ability of compounds to enter cells, e.g., by reducing or 
eliminating charge, increasing lipophilicity, conjugating the molecule to a moiety targeting a 
cell surface receptor such that after interacting with the receptor). Methods for introducing 

10 larger molecules, e.g., peptides and fusion proteins are also well known, including, e.g., 
injection, liposome-mediated fusion, application of a hydrogel, conjugation to a targeting 
moiety conjugate endocytozed by the cell, electroporation,' and the like). 

In one embodiment, the antagonist or agent is a fusion polypeptide or 
derivatized polypeptide. A fusion or derivatized protein may include a targeting moiety that 

1 5 increases the ability of the polypeptide to traverse a cell membrane or causes the polypeptide to 
be delivered to a specified cell type (e.g., liver cells or tumor cells) preferentially or cell 
compartment (e.g., nuclear compartment) preferentially. Examples of targeting moieties 
include lipid tails, amino acid sequences such as antennapoedia peptide or a nuclear localization 
signal (NLS; e.g., Xenopus nucleoplasmin Robbins et al., 1991, Cell 64:615). 

20 In one embodiment of the invention, a peptide sequence or peptide analog 

determined to inhibit a PDZ domain-PL protein binding, in an assay of the invention is 
introduced into a cell by linking the sequence to an amino acid sequence that facilitates its 
transport through the plasma membrane (a "transmembrane transporter sequence"). The 
peptides of the invention may be used directly or fused to a transmembrane transporter 

25 sequence to facilitate their entry into cells. In the case of such a fusion peptide, each peptide 
may be fused with a heterologous peptide at its amino terminus directly or by using a flexible 
polylinker such as the pentamer G-G-G-G-S (SEQ ID NO: 1) repeated 1 to 3 times. Such linker 
has been used in constructing single chain antibodies (scFv) by being inserted between V H and 
V L (Bird et aL, 1988, Science 242:423-426; Huston et al., 1988, Proc. Natl. Acad Sci. U.S.A. 

30 85:5979-5883). The linker is designed to enable the correct interaction between two beta- 
sheets forming the variable region of the single chain antibody. Other linkers which may be 
used include Glu-Gly-Lys-Ser-Ser-Gly-Ser-Gly-Ser-Glu-Ser-Lys-Val-Asp (SEQ ID NO:2) 
(Chaudhary et al., 1990,Proc. Natl. Acad. Sci. U.S.A. 87:1066-1070) and Lys-Glu-Ser-Gly-Ser- 
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Val-Ser-Ser-Glu-Gln-Leu-Ala-Gln-Phe-Arg-Ser-Leu-Asp (SEQ ID NO:3) (Bird et al. , 1 988, 
Science 242:423-426). 

A number of peptide sequences have been described in the art as capable of 
facilitating the entry of a peptide linked to these sequences into a cell through the plasma 
5 membrane (Derossi et al, 1998, Trends in Cell Biol. 8:84). For the purpose of this invention, 
such peptides are collectively referred to as transmembrane transporter peptides. Examples of 
these peptide include, but are not limited to, tat derived from HIV (Vives et al., 1997, J. Biol. 
Chem. 272:16010; Nagahara et al, 1998, Nat. Med 4:1449), antennapedia from Drosophila 
(Derossi et al, 1994, J. Biol. Chem, 261:10444), VP22 from herpes simplex virus (Elliot and 

10 D'Hare, 1997, Cell .88:223-233), complementarity-determining regions (CDR) 2 and 3 of anti- 
DNA antibodies (Avrameas et al., 1998, Proa Natl Acad. ScL U.S.A., 95:5601-5606), 70 KDa 
heat shock protein (Fujihara, 1999, EMBOJ. 18:41 1-419) and transportan (Pooga et al., 1998, 
FASEB J. 12:67-77). In a preferred embodiment of the invention, a truncated HTV tat peptide 
having the sequence of GYGRKKRRQRRRG (SEQ ID NO:4) is used. 

15 It is preferred that a transmembrane transporter sequence is fused to a 

hematopoietic cell surface receptor carboxyl terminal sequence at its amino-terminus with or 
without a linker. Generally, the C-terminus of a PDZ motif sequence (PL sequence) must be 
free in order to interact with a PDZ domain. The transmembrane transporter sequence may be 
used in whole or in part as long as it is capable of facilitating entry of the peptide into a cell. 

20 In an alternate embodiment of the invention, a hematopoietic cell surface 

receptor C-terminal sequence may be used alone when it is delivered in a maimer that allows its 
entry into cells in the absence of a transmembrane transporter sequence. For example, the 
peptide may be delivered in a liposome formulation or using a gene therapy approach by 
delivering a coding sequence for the PDZ motif alone or as a fusion molecule into a target cell. 

25 The compounds of the of the invention may also be administered via liposomes, 

which serve to target the conjugates to a particular tissue, such as lymphoid tissue, or targeted 
selectively to infected cells, as well as increase the half-life of the peptide composition. 
Liposomes include emulsions, foams, micelles, insoluble monolayers, liquid crystals, 
phospholipid dispersions, lamellar layers and the like. In these preparations the peptide to be 

30 delivered is incorporated as part of a liposome, alone or in conjunction with a molecule which 
binds to, e.g., a receptor prevalent among lymphoid cells, such as monoclonal antibodies which 
bind to the CD45 antigen, or with other therapeutic or immunogenic compositions. Thus, 
liposomes filled with a desired peptide or conjugate of the invention can be directed to the site 
of lymphoid cells, where the liposomes then deliver the selected inhibitor compositions. 
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Liposomes for use in the invention are formed from standard vesicle-forming lipids, which 
generally include neutral and negatively charged phospholipids and a sterol, such as 
cholesterol. The selection of lipids is generally guided by consideration of, e.g., liposome size, 
acid lability and stability of the liposomes in the blood stream. A variety of methods are 
5 available for preparing liposomes, as described in, e.g., Szoka et al., Ann. Rev. Biophys. 
Bioeng. 9:467 (1980), U.S. Pat. Nos. 4,235,871, 4,501,728 and 4,837,028. 

The targeting of liposomes using a variety of targeting agents is well known in 
the art (see, e.g., U.S. Patent Nos. 4,957,773 and 4,603,044). For targeting to the immune cells, 
a ligand to be incorporated into the liposome can include, e.g., antibodies or fragments thereof 

10 specific for ceil surface determinants of the desired immune system cells. A liposome 
suspension containing a peptide or conjugate may be administered intravenously, locally, 
topically, etc. in a dose which varies according to, inter alia, the manner of administration, the 
conjugate being delivered, and the stage of the disease being treated. 

In order to specifically deliver a PDZ motif sequence (PL sequence) peptide into 

15 a specific cell type, the peptide may be linked to a cell-specific targeting moiety, which include 
but are not limited to, ligands for diverse leukocyte surface molecules such as growth factors, 
hormones and cytokines, as well as antibodies or antigen-binding fragments thereof. Since a 
large number of cell surface receptors have been identified in leukocytes, ligands or antibodies 
specific for these receptors may be used as cell-specific targeting moieties. For example, 

20 interleukin-2, B7-1 (CD80), B7-2 (CD86) and CD40 orpeptide fragments thereof may be used 
to specifically target activated T cells (The Leucocyte Antigen Facts Book, 1997, Barclay et al. 
(eds.), Academic Press). CD28, CTLA-4 and CD40L or peptide fragments thereof may be used 
to specifically target B cells. Furthermore, Fc domains may be used to target certain Fc 
receptor-expressing cells such as monocytes. 

25 Antibodies are the most versatile cell-specific targeting moieties because they 

can be generated against any cell surface antigen. Monoclonal antibodies have been generated 
against leukocyte lineage-specific markers such as certain CD antigens. Antibody variable 
region genes can be readily isolated from hybridoma cells by methods well known in the art. 
However, since antibodies are assembled between two heavy chains and two light chains, it is 

30 preferred that a scFv be used as a cell-specific targeting moiety in the present invention. Such 
scFv are comprised of V H and V L domains linked into a single polypeptide chain by a flexible 
linker peptide. 

The PDZ motif sequence (PL sequence) may be linked to a transmembrane 
transporter sequence and a cell-specific targeting moiety to produce a tri-fusion molecule. This 
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molecule can bind to a leukocyte surface molecule, passes through the membrane and targets 
PDZ domains. Alternatively, a PDZ motif sequence (PL sequence) may be linked to a cell- 
specific targeting moiety that binds to a surface molecule that internalizes the fusion peptide. 

In an other approach, microspheres of artificial polymers of mixed amino acids 
5 (proteinoids) have been used to deliver pharmaceuticals. For example, U.S. Pat. No. 4,925,673 
describes drug-containing proteinoid microsphere carriers as well as methods for their 
preparation and use. These proteinoid microspheres are useful for the delivery of a number of 
active agents. Also see, U.S. Patent Nos. 5,907,030 and 6,033,884, which are incorporated 
herein by reference. 

10 

B. Introduction of Polynucleotides into Cells 

By introducing gene sequences into cells, gene therapy can be used to treat 
conditions in which leukocytes are activated to result in deleterious consequences. In one 
embodiment, a polynucleotide that encodes a PL sequence peptide of the invention is 
1 5 introduced into a cell where it is expressed. The expressed peptide then inhibits the interaction 
of PDZ proteins and PL proteins in the cell. 

Thus, in one embodiment, the polypeptides of the invention are expressed in a 
cell by introducing a nucleic acid (e.g., a DNA expression vector or mRNA) encoding the 
desired protein or peptide into the cell. Expression may be either constitutive or inducible 
20 depending on the vector and choice of promoter. Methods for introduction and expression of 
nucleic acids into a cell are well known in the art and described herein. 

In a specific embodiment, nucleic acids comprising a sequence encoding a 
peptide disclosed herein, are administered to a human subject. In this embodiment of the 
invention, the nucleic acid produces its encoded product that mediates a therapeutic effect. Any 
25 of the methods for gene therapy available in the art can be used according to the present 
invention. Exemplary methods are described below. 

For general reviews of the methods of gene therapy, see Goldspiel et al., 1993, 
Clinical Pharmacy 12:488-505; Wu and Wu, 1991, Biotherapy 3: 87-95; Tolstoshev, 1993, Ann. 
Rev. Pharmacol. Toxicol. 32:573-596; Mulligan, 1993, Science 260:926-932; and Morgan and 
30 Anderson, 1993, Ann. Rev. Biochem. 62:191-217; May, 1993, TIBTECH 1 1(5): 155-215. 
Methods commonly known in the art of recombinant DNA technology which can be used are 
described in Ausubel et al. (eds.), 1 993 , Current Protocols in Molecular Biology, John Wiley & 
Sons, NY; and Kriegler, 1990, Gene Transfer and Expression, A Laboratory Manual, Stockton 
Press, NY. 
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In a preferred embodiment of the invention, the therapeutic composition 
comprises a coding sequence that is part of an expression vector. In particular, such a nucleic 
acid has a promoter operably linked to the coding sequence, said promoter being inducible or 
constitutive, and, optionally, tissue-specific. In another specific embodiment, a nucleic acid 
5 molecule is used in which the coding sequence and any other desired sequences are flanked by 
regions that promote homologous recombination at a desired site in the genome, thus providing 
for intrachromosomal expression of the nucleic acid (Koller and Smithies, 1989, Proc. Natl. 
Acad. Sci. USA 86:8932-8935; Zijlstra et al., 1989, Nature 342:435-438). 

Delivery of the nucleic acid into a patient may be either direct, in which case the 

10 patient is directly exposed to the nucleic acid or nucleic acid-carrying vector, or indirect, in 
which case, cells are first transformed with the nucleic acid in vitro, then transplanted into the 
patient. These two approaches are known, respectively, as in vivo or ex vivo gene therapy. 

In a specific embodiment, the nucleic acid is directly administered in vivo, where 
it is expressed to produce the encoded product. This can be accomplished by any methods 

1 5 known in the art, e.g. , by constructing it as part of an appropriate nucleic acid expression vector 
and administering it so that it becomes intracellular, eg, by infection using a defective or 
attenuated retroviral or other viral vector (see U.S. Patent No. 4,980,286), by direct injection of 
naked DNA, by use of microparticle bombardment (e.g., a gene gun; Biolistic, Dupont), by 
coating with lipids or cell-surface receptors or transfecting agents, by encapsulation in 

20 liposomes, microparticles, or microcapsules, by administering it in linkage to a peptide which is 
known to enter the nucleus, or by administering it in linkage to a ligand subject to receptor- 
mediated endocytosis (see e.g., Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432) which can 
be used to target cell types specifically expressing the receptors. In another embodiment, a 
nucleic acid-ligand complex can be formed in which the ligand comprises a fusogenic viral 

25 peptide to disrupt endosomes, allowing the nucleic acid to avoid lysosomal degradation. In yet 
another embodiment, the nucleic acid can be targeted in vivo for cell specific uptake and 
expression, by targeting a specific receptor (see, e.g., PCT Publications WO 92/06180 dated 
April 16, 1992; WO 92/22635 dated December 23, 1992; WO92/20316 dated November 26, 
1992; W093/14188 dated July 22, 1993; WO 93/20221 dated October 14, 1993). 

30 Alternatively, the nucleic acid can be introduced intracellularly and incorporated within host 
cell DNA for expression, by homologous recombination (Koller and Smithies, 1 989, Proc. Natl. 
Acad. Sci. USA 86:8932-8935; Zijlstra et al., 1989, Nature 342:435-438). 

In a preferred embodiment of the invention, adenoviruses as viral vectors can be 
used in gene therapy. Adenoviruses have the advantage of being capable of infecting non- 
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dividing cells (Kozarsky and Wilson, 1993, Current Opinion in Genetics and Development 
3:499-503). Other instances of the use of adenoviruses in gene therapy can be found in 
Rosenfeld et al, 1991, Science 252:431-434; Rosenfeld et al., 1992, Cell 68:143-155; and 
Mastrangeli et al., 1993, J. Clin. Invest. 91:225-234. Furthermore, adenoviral vectors with 
5 modified tropism may be used for cell specific targeting (WO98/40508). Adeno-associated 
virus (AAV) has also been proposed for use in gene therapy (Walsh et al., 1993, Proc. Soc. 
Exp. Biol. Med. 204:289-300). 

In addition, retroviral vectors (see Miller et al, 1 993, Meth. Enzymol. 217:581- 
599) have been modified to delete retroviral sequences that are not necessary for packaging of 

10 the viral genome and integration into host cell DNA. The coding sequence to be used in gene 
"therapy is cloned into the vector, which facilitates delivery of the gene into a patient. More, 
detail about retroviral vectors can be found in Boesen et al., 1994, Biotherapy 6:29 1-302, which 
describes the use of a retroviral vector to deliver the mdrl gene to hematopoietic stem cells in 
order to make the stem cells more resistant to chemotherapy. Other references illustrating the 

15 use of retroviral vectors in gene therapy are: Clowes et al., 1994, J. Clin. Invest. 93:644-651; 
Kiem et al., 1994, Blood 83: 1467-1473; Salmons and Gunzberg, 1993, Human Gene Therapy 
4:129-141; and Grossman and Wilson, 1993, Curr. Opin. in Genetics and Devel. 3:110-114. 

Another approach to gene therapy involves transferring a gene to cells in tissue 
culture. Usually, the method of transfer includes the transfer of a selectable marker to the cells. 

20 The cells are then placed under selection to isolate those cells that have taken up and are 
expressing the transferred gene. Those cells are then delivered to a patient. 

In this embodiment, the nucleic acid is introduced into a cell prior to 
administration in vivo of the resulting recombinant cell. Such introduction can be carried out 
by any method known in the art, including but not limited to transfection, electroporation, 

25 lipofection, microinjection, infection with a viral or bacteriophage vector containing the nucleic 
acid sequences, cell fusion, chromosome-mediated gene transfer, microcell-mediated gene 
transfer, spheroplast fusion, etc. Numerous techniques are known in the art for the introduction 
of foreign genes into cells (see e.g., Loeffler and Behr, 1993, Meth. Enzymol. 217:599-618; 
Cohen et al., 1993, Meth. Enzymol. 217:618-644; Cline, 1985, Pharmac. Ther. 29:69-92) and 

30 may be used in accordance with the present invention, provided that the necessary 
developmental and physiological functions of the recipient cells are not disrupted. The 
technique should provide for the stable transfer of the nucleic acid to the cell, so that the nucleic 
acid is expressible by the cell and preferably heritable and expressible by its cell progeny. In a 
preferred embodiment, the cell used for gene therapy is autologous to the patient. 
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In a specific embodiment, the nucleic acid to be introduced forpurposes of gene 
therapy comprises an inducible promoter operably linked to the coding sequence, such that 
expression of the nucleic acid is controllable by controlling the presence or absence of the 
appropriate inducer of transcription. 
5 Oligonucleotides such as anti-sense RNA and DNA molecules, and ribozymes 

that function to inhibit the translation of a targeted mRNA, especially its C-terminus are also 
within the scope of the invention. Anti-sense RNA and DNA molecules act to directly block 
the translation of mRNA by binding to targeted mRNA and preventing protein translation. In 
regard to antisense DNA, oligodeoxyribonucleotides derived from the translation initiation site, 

10 e.g., between -10 and +10 regions of a nucleotide sequence, are preferred. 

The antisense oligonucleotide may comprise at least one modified base moiety 
which is selected from the group including, but not limited to, 5-fluorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 
5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 

1 5 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1 -methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 

20 isopentenyladenine, uracil-5-oxy acetic acid (v), wybutoxosine, pseudouracil, queosine, 

2- thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil- 
5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino- 

3- N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific 
25 cleavage of RNA. The mechanism of ribozyme action involves sequence specific hybridization 
of the ribozyme molecule to complementary target RNA, followed by endonucleolytic cleav- 
age. Within the scope of the invention are engineered hammerhead motif ribozyme molecules 
that specifically and efficiently catalyze endonucleolytic cleavage of target RNA sequences. 

Specific ribozyme cleavage sites within any potential RNA target are initially 
30 identified by scanning the target molecule for ribozyme cleavage sites which include the 
following sequences, GUA, GUU and GUC. Once identified, short RNA sequences of between 
15 and 20 ribonucleotides corresponding to the region of the target gene containing the 
cleavage site may be evaluated for predicted structural features such as secondary structure that 
may render the oligonucleotide sequence unsuitable. The suitability of candidate targets may 
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also be evaluated by testing their accessibility to hybridization with complementary 
oligonucleotides, using ribonuclease protection assays. 

The anti-sense RNA and DNA molecules and ribozymes of the invention may be 
prepared by any method known in the art for the synthesis of nucleic acid molecules. These 
5 include techniques for chemically synthesizing oligodeoxyribonucleotides well known in the 
art such as for example solid phase phosphoramidite chemical synthesis. Alternatively, RNA 
molecules may be generated by in vitro and in vivo transcription of DNA sequences encoding 
the RNA molecule. Such DNA sequences may be incorporated into a wide variety of vectors 
which contain suitable RNA polymerase promoters such as the T7 or SP6 polymerase 
10 promoters. Alternatively, antisense cDNA constructs that synthesize antisense RNA 
constitutively or inducibly, depending on the promoter used, can be introduced stably into cell 
lines. 

Various modifications to the DNA molecules may be introduced as a means of 
increasing intracellular stability and half-life. Possible modifications include, but are not 
1 5 limited to, the addition of flanking sequences of ribo- or deoxy- nucleotides to the 5' and/or 3' 
ends of the molecule or the use of phosphorothioate or 2' O-methyl rather than phospho- 
diesterase linkages within the oligodeoxyribonucleotide backbone. 

C. Other Pharmaceutical Compositions 

20 The compounds of the invention, may be administered to a subject per se or in 

the form of a sterile composition or a pharmaceutical composition. Pharmaceutical 
compositions comprising the compounds of the invention may be manufactured by means of 
conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, 
encapsulating, entrapping or lyophilizing processes. Pharmaceutical compositions may be 

25 formulated in conventional manner using one or more physiologically acceptable carriers, 
diluents, excipients or auxiliaries that facilitate processing of the active peptides or peptide 
analogues into preparations which can be used pharmaceutically. Proper formulation is 
dependent upon the route of administration chosen. 

For topical administration the compounds of the invention may be formulated as 

30 solutions, gels, ointments, creams, suspensions, etc. as are well-known in the art. 

Systemic formulations include those designed for administration by injection, 
e.g. subcutaneous, intravenous, intramuscular, intrathecal or intraperitoneal injection, as well as 
those designed for transdermal, transmucosal, oral or pulmonary administration. 
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For injection, the compounds of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's 
solution, or physiological saline buffer. The solution may contain formulatory agents such as 
suspending, stabilizing and/or dispersing agents. 
5 Alternatively, the compounds may be in powder form for constitution with a 

suitable vehicle, e.g., sterile pyrogen-free water, before use. 

For transmucosal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art. This 
route of administration may be used to deliver the compounds to the nasal cavity. 
1 0 For oral administration, the compounds can be readily formulated by combining 

the active peptides or peptide analogues with pharmaceutically acceptable carriers well known 
in the art. Such carriers enable the compounds of the invention to be formulated as tablets, 
pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral 
ingestion by a patient to be treated. For oral solid formulations such as, for example, powders, 
1 5 capsules and tablets, suitable excipients include fillers, such as sugars, such as lactose, sucrose, 
mannitol and sorbitol; cellulose preparations such as maize starch, wheat starch, rice starch, 
potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, 
sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP); granulating agents; and 
binding agents. If desired, disintegrating agents may be added, such as the cross-linked 
20 polyvinylpyrrolidone, agar, or alginic acid or a salt thereof such as sodium alginate. 

If desired, solid dosage forms may be sugar-coated or enteric-coated using 
standard techniques. 

For oral liquid preparations such as, for example, suspensions, elixirs and 
solutions, suitable carriers, excipients or diluents include water, glycols, oils, alcohols, etc. 
25 Additionally, flavoring agents, preservatives, coloring agents and the like may be added. 

For buccal administration, the compounds may take the form of tablets, 
lozenges, etc. formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray from pressurized packs or a 
30 nebulizer, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 
trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the 
case of a pressurized aerosol the dosage unit may be determined by providing a valve to deliver 
a metered amount. Capsules and cartridges of e.g. gelatin for use in an inhaler or insufflator 
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may be formulated containing a powder mix of the compound and a suitable powder base such 
as lactose or starch. 

The compounds may also be formulated in rectal or vaginal compositions such 
as suppositories or retention enemas, e.g., containing conventional suppository bases such as 
5 cocoa butter or other glycerides. 

In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
10 materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

Alternatively, other pharmaceutical delivery systems may be employed. 
Liposomes and emulsions are well known examples of delivery vehicles that may be used to 
deliver peptides and peptide analogues of the invention. Certain organic solvents such as 
15 dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid polymers containing the therapeutic agent. Various of 
sustained-release materials have been established and are well known by those skilled in the art. 
Sustained-release capsules may, depending on their chemical nature, release the compounds 
20 for a few weeks up to over 100 days. Depending on the chemical nature and the biological 
stability of the therapeutic reagent, additional strategies for protein stabilization may be 
employed. 

As the compounds of the invention may contain charged side chains or termini, 
they may be included in any of the above-described formulations as the free acids or bases or as 
25 pharmaceutical^ acceptable salts. Phannaceutically acceptable salts are those salts which 
substantially retain the biologic activity of the free bases and which are prepared by reaction 
with inorganic acids. Pharmaceutical salts tend to be more soluble in aqueous and other pro tic 
solvents than are the corresponding free base forms. 

D. Effective Dosages 
30 The compounds of the invention will generally be used in an amount effective to 

achieve the intended purpose. The compounds of the invention or pharmaceutical compositions 
thereof, are administered or applied in a therapeutically effective amount. By therapeutically 
effective amount is meant an amount effective ameliorate or prevent the symptoms, or prolong 
the survival of, the patient being treated. Determination of a therapeutically effective amount is 
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well within the capabilities of those skilled in the art, especially in light of the detailed 
disclosure provided herein. An "inhibitory amount" or "inhibitory concentration" of a PL-PDZ 
binding inhibitor is an amount that reduces binding by at least about 40%, preferably at least 
about 50%, often at least about 70%, and even as much as at least about 90%. Binding can as 
5 measured in vitro (e.g., in an A assay or G assay) or in situ. 

For systemic administration, a therapeutically effective dose can be estimated 
initially from in vitro assays. For example, a dose can be formulated in animal models to 
achieve a circulating concentration range that includes the IC 50 as determined in cell culture. 
Such information can be used to more accurately determine useful doses in humans. 
10 Initial dosages can also be estimated from in vivo data, e.g., animal models, 

using techniques that are well known in the art. One having ordinary skill in the art could 
readily optimize administration to humans based on animal data. 

Dosage amount and interval may be adjusted individually to provide plasma 
levels of the compounds that are sufficient to maintain therapeutic effect. Usual patient 
1 5 dosages for administration by injection range from about 0.1 to 5 mg/kg/day, preferably from 
about 0.5 to 1 mg/kg/day. Therapeutically effective serum levels may be achieved by 
administering multiple doses each day. 

In cases of local administration or selective uptake, the effective local 
concentration of the compounds may not be related to plasma concentration. One having skill 
20 in the art will be able to optimize therapeutically effective local dosages without undue 
experimentation. 

The amount of compound administered will, of course, be dependent on the 
subject being treated, on the subject's weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 
25 The therapy may be repeated intermittently while symptoms detectable or even 

when they are not detectable. The therapy may be provided alone or in combination with other 
drugs. In the case of conditions associated with leukocyte activation such as transplantation 
rejection and autoimmunity, the drugs that may be used in combination with the compounds of 
the invention include, but are not limited to, steroid and non-steroid anti-inflammatory agents. 
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E. Toxicity 

Preferably, a therapeutically effective dose of the compounds described herein 
will provide therapeutic benefit without causing substantial toxicity. 

Toxicity of the compounds described herein can be determined by standard 
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pharmaceutical procedures in cell cultures or experimental animals, e.g., by determining the 
LD 50 (the dose lethal to 50% of the population) or the LD 100 (the dose lethal to 100% of the 
population). The dose ratio between toxic and therapeutic effect is the therapeutic index. 
Compounds which exhibit high therapeutic indices are preferred. The data obtained from these 
5 cell culture assays and animal studies can be used in formulating a dosage range that is not 
toxic for use in human. The dosage of the compounds described herein lies preferably within a 
range of circulating concentrations that include the effective dose with little or no toxicity. The 
dosage may vary within this range depending upon the dosage form employed and the route of 
administration utilized. The exact formulation, route of administration and dosage can be 
10 chosen by the individual physician in view of the patient's condition. {See, e.g., Fingl et ai 9 
1975, In: The Pharmacological Basis of Therapeutics, Ch.l, p.l). 



EXAMPLE 1 

15 

GENERATION OF EUKARYOTIC EXPRESSION CONSTRUCTS BEARING DNA 
FRAGMENTS THAT ENCODE PDZ DOMAIN CONTAINING GENES OR PORTIONS 

OF PDZ DOMAIN GENES 

20 This example describes the cloning of PDZ domain containing genes or portions 

of PDZ domain containing genes were into eukaryotic expression vectors in fusion with a 
number of protein tags, including but not limited to Glutathione S-Transferase (GST), 
Enhanced Green Fluorescent Protein (EGFP), or Hemagglutinin (HA). 

25 A. Strategy 

DNA fragments corresponding to PDZ domain containing genes were generated 
by RT-PCR from RNA from a library of individual cell lines (CLONTECH Cat# K4000-1) 
derived RNA, using random (oligo-nucleotide) primers (Invitrogen Cat# 4819001 1). DNA 
fragments corresponding to PDZ domain containing genes or portions of PDZ domain 

30 containing genes were generated by standard PCR, using above purified cDNA fragments and 
specific primers (see Table 5). Primers used were designed to create restriction nuclease 
recognition sites at the PCR fragment's ends, to allow cloning of those fragments into 
appropriate expression vectors. Subsequent to PCR, DNA samples were submitted to agarose 
gel electrophoresis. Bands corresponding to the expected size were excised. DNA was 

35 extracted by Sephaglas Band Prep Kit (Amersham Pharmacia Cat# 27-9285-0 1) and digested 
with appropriate restriction endonuclease. Digested DNA samples were purified once more by 
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gel electrophoresis, according to the same protocol used above. Purified DNA fragments were 
coprecipitated and ligated with the appropriate linearized vector. After transformation into 
E.coli, bacterial colonies were screened by colony PCR and restriction digest for the presence 
and correct orientation of insert. Positive clones were innoculated in liquid culture for large 
scale DNA purification. The insert and flanking vector sites from the purified plasmid DNA 
were sequenced to ensure correct sequence of fragments and junctions between the vectors and 
fusion proteins. 

B. Vectors: 

All PDZ domain-containing genes were cloned into the vector pGEX-3X 
(Amersham Pharmacia #27-4803-01, Genemed Acc#U13852, GI#595717), containing a tac 
promoter, GST, Factor Xa, P-lactamase, and lac repressor. 

The amino acid sequence of the pGEX-3X coding region including GST, Factor 
Xa, and the multiple cloning site is listed below. Note that linker sequences between the cloned 
inserts and GST-Factor Xa vary depending on the restriction endonuclease used for cloning. 
Amino acids in the translated region below that may change depending on the insertion used 
are indicated in small caps, and are included as changed in the construct sequence listed in (C). 
aa 1 - aa 232: 

MSPILGYWKIKGLVQPTRLLLEYLE 
NLPYYTDGDVKLTQSMAIIRY^ 

RIAYSKDFETLKVDFLSKLPEMLKMFEDRLCHKTYLNGDHVTHPDFMLYDAL 
DVVLYMDPMCLDAFPKXVCFKKIUEAIPQIDKYLKSSKYIAWPLQGWQATFGG 
GDHPPKSDLIEGRgipgnss (SEQ ID NO:5) 

In addition, TAX Interacting Protein 1 (TIP 1 ), in whole or part, was cloned into 
many other expression vectors, including but not limited to CD5y, PEAK 10 (both provided by 
the laboratory of Dr. Brian Seed at Harvard University and generated by recombinant DNA 
technology, containing an IgG region), and MIN (a derivative of MSCV, containing IRES and 
NGFR, generated by recombinant DNA technology). 

C. Constructs: 

Primers used to generate DNA fragments by PCR are listed in Table 5. PCR 
primer combinations and restriction sites for insert and vector are listed below, along with 
amino acid translation for insert and restriction sites. Non-native amino acid sequences are 
shown in lower case. 
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TABLE 5. Primers used in cloning of DLG 1 (domain 2 of 3), MAGI 1 (domain 2 of 6), and 



TIP1 into representative expression vectors. 



ID# (Primer Name) 


Primer Sequence 


Description 


1928(654DL1 2F) 


AATGGGGATCCAGCT 
CATTAAAGG (SEQ ID 
NO:6) 


Forward (5 ' to 3 ') primer corresponding to DLG 
1, domain 2 of 3. Generates a Bam HI site 
upstream (5 ) of the PDZ boundary. Used for 
cloning into pGEX-3X. 


1929 (655DL1 2R) 


ATACATACTTGTGGA 
ATTCGCCAC (SEQ ID 
NO:7) 


Reverse (3' to 5') primer corresponding to DLG 
1, domain 2 of 3. Generates an EcoRl site 
downstream (3 ) of the PDZ boundary. Used for 
cloning into pGEX-3X. 


1453 (435BAF) 


CACGGATCCCTTCTG 
AGTTGAAAGGC (SEQ 
ID NO:8) 


Forward (5' to 3') primer corresponding to 
MAGI l, domain 2 of 6. Generates a BamHl 
site upstream (5') of the PDZ boundary. Used for 
cloning into pGEX-3X. 


1454 (436BAR) 


TATGAATTCCATCTG 
GATCAAAAGGCAAT 
G (SEQ ID NO:9) 


Reverse (3 to 5 ) primer corresponding to MAGI 
l, domain 2 of 6. Generates an EcoRl site 
downstream (3 ') of the PDZ boundary. Used for 
clonmg into pGEX-3X. 


399 (86TAF) 


CAGGGATCCAAAGA 
GTTGAAATTCACAAG 
C (SEQ ID NO: 10) 


Forward (5 ' to 3 ') primer corresponding to TIP 1 . 
Generates a Bam HI site upstream (5') of the 
PDZ boundary. Used for cloning into pGEX-3X. 


400 (87TAR) 


ACGGAATTCTGCAGC 
GACTGCCGCGTC 
(SEQ ID NO: 11) 


Reverse (3 ' to 5 ') primer corresponding to TIP 1 . 
Generates an EcoRl site downstream (3') of the 
PDZ boundary. Used for clonmg mto pGEX-3X. 


1319 (TIP G5-1) 


AGGATCCAGATGTCC 
TACATCCC (SEQ ID 
NO: 12) 


Forward (5 ' to 3 ') primer corresponding to TIP 1 . 
Generates a Bam HI site upstream (5') of the 
start codon. Used for clonmg mto pGEX-3X. 


1320 (TIP G3-1) 


GGAATTCATGGACTG 
CTGCACGG (SEQ ID 
NO:13) 


Reverse (3 ' to 5 ') primer corresponding to TIP 1 . 
Generates an EcoRl site downstream (3 ') of the 
stop codon. Used for cloning into pGEX-3X. 


2753 (1109TIF) 


AGAGAATTCTCGAGA 
TGTCCTACATCCC 
(SEQ ID NO: 14) 


Forward (5 to 3 ) primer corresponding to TIP 1 . 
Generates an EcoRl site upstream (5') of the 
start codon. Used for cloning into MIN. 


2762(1 11 7TIR) 


TGGGAATTCCTAGGA 
CAGCATGGACTG 
(SEQ ID NO: 15) 


Reverse (3 ' to 5 ') primer corresponding to TIP 1 . 
Generates an EcoRl site downstream (3 ) of the 
stop codon. Used for cloning into MIN. 


2584(1080TIF) 


CTAGGATCCGGGCCA 
GCCGGTCACC (SEQ 
ID NO: 16) 


Forward (5' to 3 ') primer corresponding to TIP1 . 
Generates a Bam HI site upstream (5') of the 
PDZ boundary. Used for clonmg mto PEAK 10 
or CD5y. 


2585 (1081TIR) 


GACGGATCCCCCTGC 
TGCACGGCCTTCTG 
(SEQ ID NO: 17) 


Reverse (3' to 5') primer corresponding to TIP1 . 
Generates a Bam H 1 site downstream (3 ') of the 
PDZ boundary. Used tor clonmg mto rhAKlU 
or CD5y. 


2586(1082TIR) 


GACGAATTCCCCTGC 
TGCACGGCCTTCTG 
(SEQ ID NO: 18) 


Reverse (3 ' to 5 ') primer corresponding to TIP 1 . 
Generates an EcoRl site downstream (3 ') of the 
PDZ boundary. Used for cloning into PEAK 10 
or CD5y. 


2587 (1083TIF) 


CTAGAATTCGGGCCA 
GCCGGTCACC (SEQ 
ID NO: 19) 


Forward (5 ' to 3 ') primer corresponding to TIP 1 . 
Generates an Eco Rl site upstream (5') of the 
PDZ boundary. Used for cloning into PEAK10 
or CD5y. 
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1. DLG l.PDZdomain2of3: 

Acc#:U13897 
GI#:558437 

•Construct: DLG 1, PDZ domain 2 of 3-pGEX-3X 
Primers: 1928 & 1929 

Vector Cloning Sites(573'): Bam Hl/EcoRl 
Insert Cloning Sites(573'): BamHl/EcoRl 

aa 1- aa 88 

giqLIKGPKGLGFSIAGGVGNQHIPGDNSIYVTKnEGGAAHKDGKLQIGDKLLAV 
NNVCLEEVTHEEAVTALKNTSDFVYLKVAnss (SEQ ID NO:20) 

2. MAGI 1. PDZ domain 2 of 6: 

Acc#:AB0 10894 
GI#:3370997 

•Construct: MAGI 1 , PDZ domain 2 of 6-pGEX-3X 
Primers: 1453 & 1454 

Vector Cloning Sites(573'): Bam Hl/EcoRl 
Insert Cloning Sites(573'): BamHl/EcoRl 

aal-aal08 

giPSELKGKFIHTKLRKSSRGFGFTWGGDEPDEFLQIKSLVLD 
GPAALDGKMETGDVWSVNDTCVLGHTHAQVVKIFQSIPIGA 
SVDLELCRGYPLPFDPDgihrd (SEQ ID NO:21) 

3. TAX Interacting Protein 1 (TIPD: 

Acc#:AF028823.2 
GI#:11908159 
•Construct: TEP1, PDZ domain 1 of l-pGEX-3X 
Primers: 399& 400 

Vector Cloning Sites(573'): Bam Hl/EcoRl 
Insert Cloning Sites(573'): BamHl/EcoRl 

aa 1- aa 107 

giQRVEIHKLRQGENLILGFSIGGGIDQDPSQNPFSEDKTDKGI 
YVTRVSEGGPAEIAGLQIGDKIMQVNGWDMTMVTHDQARK 
RLTKRSEEWRLLVTRQSLQnss (SEQ ID NO:22) 

•Construct: TIPl-pGEX-3X 

Primers: 1319& 1320 

Vector Cloning Sites(573'): Bam Hl/EcoRl 
Insert Cloning Sites(573'): BamHl/EcoRl 

aa 1-aa 128 

giqMSYIPGQPVTAWQRVEIHKLRQGENLILGFSIGGGIDQDPSQNPF 
SEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNGWDMTMVTHDQ 
ARKRLTKRSEEVVRLLVTRQSLQKAVQQSMnss (SEQ ID NO:23) 

•Construct: TIP1-MIN 

Primers: 2753& 2762 

Vector Cloning Sites(573'): EcoRl/EcoRl 
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Insert Cloning Sites(5 73'): EcoRl/EcoRl 
aa 1-aa 129 

agilEMSYIPGQPVTAWQRVEIHKLRQGENLILGFSIGGGIDQD 
5 PSQNPFSEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNG 
WDMTMVTHDQARKRLTKRSEEWRLLVTRQSLQKAVQQS 
MLS(SEQIDNO:24) 

•Construct: TIP 1 -CD5 y 
10 Primers: 2584&258S 

Vector Cloning Sites(573'): Bam HI/ Bam HI 
Insert Cloning Sites(573'): BamHl/ Bam HI 

aa 1- aa 122 

1 5 adPGQP VTAWQRVEIHKLRQGENLILGFSIGGGIDQDPSQNPF 

SEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNGWDMTM 
VTHDQ AJIKRLTKRSEE VVRLLVTRQ SLQKA VQQ Sdpe (SEQ 
ID NO:25) 

20 D. GST Fusion Protein Production and Purification 

The constructs using pGEX-3X expression vector were used to make fusion 
proteins according to the protocol outlined in the GST Fusion System, Second Edition, 
Revision 2, Pharmacia Biotech. Method II and was optimized for a 1L LgPP. 

Purified DNA was transformed into E.coli and allowed to grow to an OD of 
25 0.4-0.8 (600X). Protein expression was induced for 1-2 hours by addition of IPTG to cell 
culture. Cells were harvested and lysed. Lysate was collected and GS4B beads (Pharmacia 
Cat# 17-0756-01) were added to bind GST fusion proteins. Beads were isolated and GST 
fusion proteins were eluted with GEB II. Purified proteins were stored in GEB II at -80°C. 

Purified proteins were used for ELISA-based assays and antibody 

30 production. 

E. IgG Fusion Protein Production and Purification 

The constructs using the CD5gamma or PeaklOIgG expression vectors were 
used to make fusion protein. Purified DNA vectors were transfected into 293 EBNA T cells 
35 under standard growth conditions (DMEM +10% FCS) using standard calcium phosphate 
precipitation methods (Sambrook, Fritsch and Maniatis, Cold Spring Harbor Press) at a ratio of 
-1 ug vector DNA for 1 million cells. This vector results in a fusion protein that is secreted 
into the growth medium. Transiently transfected cells are tested for peak expression, and 
growth media containing fusion protein is collected at that maxima (usually 1 -2 days). Fusion 
40 proteins are either purified using Protein A chromatography or frozen directly in the growth 
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media without addition. 



5 

EXAMPLE 2 

IDENTIFICATION OF INTERLEUKIN 8 RECEPTOR A (EL8RA) INTERACTIONS 
10 WITH MAGI1 (domain 2), TOPI (domain 1) and MINT2 (domains 1 & 2) IN VITRO 

This example describes the binding of IL8RA to MAGI1 (domain 2 of 6), TEP1, and 
Mint2 (domains 1 & 2), assessed using a modified ELISA. Briefly, a GST-PDZ fusion was 
produced that contained the entire PDZ domain of human MAGI 1 or TIP 1 (see Example 2). In 
15 the case of Mint2 5 domains 1 and 2, the GST-PDZ fusion contained the entire PDZ domain for 
both domains 1 and 2. In addition, biotinylated peptide corresponding to the C-terminal 20 
amino acids of IL8RA was synthesized and purified by HPLC. Binding between these entities 
was detected through the "G" Assay, a colorimetric assay using avidin-HRP to bind the biotin 
and a peroxidase substrate. 

20 

A. Peptide Purification 

Peptide representing the C-terminal 20 amino acids of IL8RA was synthesized by 
standard FMOC chemistry and biotinylated if not used as an unlabeled competitor. Peptide was 
purified by reverse phase high performance liquid chromatography (HPLC) using a Vydac 

25 218TP CI 8 Reversed Phase column having the dimensions of 10*25 mm, 5 urn. 
Approximately 40 mg of peptide was dissolved in 2.0 ml of aqueous solution of 49.9% 
acetonitrile and 0.1% Tri-Fluoro acetic acid (TFA). This solution was then injected into the 
HPLC machine through a 25 micron syringe filter (Millipore). Buffers used to get a good 
separation are (A) distilled water with 0.1% TFA and (B) 0.1% TFA with Acetonitrile. 

30 Gradient Segment setup is listed in Table 7. 



TABLE 7. 



Time 


A 


B 


c 


Flow rate (ml/min) 


0 


96% 


4% 


0 


5.00 


30 


100% 


100% 


0 


5.00 


35 


100% 


100% 


0 


5.00 
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40 | 96% 1 4% ]_0 1 5.00 

The separation occurs based on the nature of the peptides. A peptide of overall hydrophobic 
nature will elute off later than a peptide of a hydrophilic nature. Fractions containing the 
"pure" peptide were collected and checked by Mass Spectrometer (MS). Purified peptides are 
5 lyophilized for stability and later use. 

B. "G" Assay for Identification of Interactions Between Peptides and Fusion Proteins 

Reagents and Materials 

• Nunc Polysorp 96 well Immuno-plate (Nunc cat#62409-005) 

10 (Maxisorp plates have been shown to have higher background signal) 

• PBS pH 7.4 (Gibco BRL cat#16777-148) or 

AVC phosphate buffered saline, 8gm NaCl, 0.29 gm KC1, 1 .44 gm Na^CM, 
0.24gm KH 2 P04, add H20 to 1 L and pH 7.4; 0.2 □ filter 

• 2% BS A/PBS (lOgm of bovine serum albumin, fraction V (ICN Biomedicals 
15 cat#IC 15 142983) into 500 ml PBS 

• Goat anti-GST mAb stock @ 5 mg/ml, store at 4°C, (Amersham Pharmacia 

cat#27-4577-01), dilute 1:1000 in PBS, final concentration 5 Dg/ml 

• HRP-Streptavidin, 2.5mg/2ml stock stored at 4°C (Zymed cat#43-4323), 

dilute 1:2000 into 2% BSA, final concentration at 0.5 Dg/ml 
20 • Wash Buffer, 0.2% Tween 20 in 50mM Tris pH 8.0 

• TMB ready to use (Dako cat#S 1 600) 

• 1MH2S04 

• 12w multichannel pipettor, 

• 50 ml reagent reservoirs, 

25 • 1 5 ml polypropylene conical tubes 

Protocol 

1) Coat plate with 100 ul of 5 ug/ml goat anti GST, O/N @ 4°C 

2) Dump coating antibodies out and tap dry 

30 3) Blocking - Add 200 ul per well 2% BSA, 2 hrs at 4°C 

4) Prepare proteins in 2% BSA 
(2ml per row or per two columns) 

5) 3 washes with cold PBS (must be cold through entire experiment) 
(at last wash leave PBS in wells until immediately adding next step) 

35 6) Add proteins at 50ul per well on ice (1 to 2 hrs at 4°C) 

7) Prepare Peptides in 2% BSA (2 ml/row or /columns) 

8) 3 X wash with cold PBS 

9) Add peptides at 50 ul per well on ice ( time on / time off) 

keep on ice after last peptide has been added for 10 minutes exactly 
40 place at room temp for 20 minutes exactly 

10) Prepare 12 ml/plate of HRP-Streptavidin (1 :2000 dilution in 2%BSA) 

11) 3 X wash with cold PBS 

12) Add HRP-Streptavidin at 100 ul per well on ice, 20 minutes at 4°C 

13) Turn on plate reader and prepare files 
45 14) 5 X washes, avoid bubbles 
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15) Using gloves, add TMB substrate at 100 ul per well 
incubate in dark at room temp 
check plate periodically (5, 10, & 20 minutes) 

- take early readings, if necessary, at 650 nm (blue) 

5 - at 20 minutes, stop reaction with 100 ul of 1M H2S04 

- take last reading at 450nm (yellow) 



10 C . Results of Binding Experiments 

Results of peptides representing the carboxy-terminal 20 amino acids of IL8RA binding 
to MAGI1, domain 2 of 6, TIP1, andMint2, domains 1 and 2, are shown in Figure 1. Clearly, 
IL8RA binds GST-MAGI 1 domain 2 and GST-TIP 1 with much higher affinity than it does to 
GST-Mint2 domains 1 & 2 at equivalent peptide concentrations and with equivalent amount of 
15 GST-PDZ fusion protein. Because the interaction between I18RA and Mint2 is not 
significantly higher than background, Mint2 PDZ's may not interact with IL8RA PL peptide 
when tested in this assay. 

20 D. Conclusions and Summary 

MAGI 1 (domain 2) and TIP 1 bind to IL8RA better than Mint2 (domains 1 and 2) bind 
to the same peptide. 

The "G" Assay provides an accurate method for testing the binding of PDZ proteins to 
PL peptides in vitro, and highlights the specificity of PDZ-PL pairing. The same peptide can 
25 interact more or less strongly with different PDZs, and binding strength is not relative for the 
same PDZ. However, binding affinity and binding patterns of PDZ's and PL's are not 
predictable, and binding profiles may change with assay variations and data interpretation. 



30 EXAMPLE 3 

IDENTIFICATION OF ALPHA ADRENERGIC RECEPTOR INTERACTIONS WITH 

PDZ PROTEINS 
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This example describes the binding of a subset of alpha adrenergic receptors and PDZ 
domains using the modified ELIS A described in the previous example. Biotinylated peptides 
corresponding to the C-terminal 20 amino acids of Alpha-2A adrenergic receptor 
(HDFRRAFKKILARGDRKRIV; SEQ ID NO:26) ? Alpha-2B adrenergic receptor 
5 (QDFRRAFRRILARPWTQTAW; SEQ ID NO:27) , and Alpha-2C adrenergic receptor 
(DFRPSFKHILFRRARRGFRQ; SEQ ID NO:28) were synthesized and purified by HPLC. 
Binding between these entities was detected through the "G" Assay, a colorimetric assay using 
avidin-HRP to bind the biotin and a peroxidase substrate. 

Tables 8 A, 8B and 8C show the results of G assay testing (described supra) between the three 
10 alpha 2 adrenergic subunits and a subset of PDZ domains. All tests are performed at lOuM 
concentration of peptide, and the peptide sequence is displayed in column 2. The background 
binding is somewhat high for these peptides (average OD), and a reduced number of 
interactions would be seen with lower peptide concentrations. Duplicate rows of PDZ GENE 
NAME and DOMAIN indicate independent sets of duplicates. A '0' in the average OD column 
1 5 indicates failure of the test. Table 8 A shows results for of Alpha-2 A adrenergic receptor (SEQ 
ID NO:26), Table 8B shows results for Alpha-2B adrenergic receptor (SEQ ID NO:27), and 
Table 8C shows results for and Alpha-2C adrenergic receptor (SEQ ID NO:28). Table 8A: 



Alpha-2 A adrenergic receptor 



GENE NAME 


DOMAIN 


AVERAGE OD 


AF6 


1 


0.3325 


AF6 


1 


0.5125 


AIPC 


1 


2.594 


AIPC 


1 


1.93 


AIPC 


3 


0.1465 


AIPC 


3 


0.2165 


AIPC 


4 


0.197 


AIPC 


4 


0.2285 


APXL1 




1.194 


APXL1 




0.5545 


CARD 14 




1.06 


CARD 14 




0.6535 


CASK 




0.2825 


CASK 




0.475 


CNK1 




0.275 


CNK1 




0.4505 


CYTOHESIN BINDING 
PROTEIN 




0.2515 


CYTOHESIN BINDING 
PROTEIN 




0.38 


DLG1 




0.142 


DLG1 




0.204 


DLG1 


2 


0.1665 


DLG1 


2 


0.272 


DLG1 


3 


0.2415 


DLG1 


3 


0.5315 


DLG1 


1,2 


0.2435 


DLG1 


1.2 


0.3955 
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GENE NAME 


DOMAIN 


AVERAGE OD 


DLG2 


1 


0.1185 


DLG2 


1 


0.2255 


DLG2 


2 


0.208 


DLG2 


2 


0.3005 


DLG5 


1 


0.1955 


DLG5 


1 


0.168 


DLG5 


2 


0.3655 


DLG5 


2 


0.6325 


DLG5 


2 


0.648 


DLG5 


2 


0.474 


DVL2 


1 . 


0.294 


DVL2 


1 


0.4565 


DVL3 


1 


0.4915 


DVL3 




0.8465 


EBP50 




0.406 


! EBP50 




0.1385 


EBP50 




0.2395 


EBP50 




0.139 


EBP50 


1,2 


0.2515 


EBP50 


1,2 


0.1295 


ENIGMA 


1 


0.3955 


ENIGMA 


1 


0.144 


ERBIN 


1 


0.2285 


ERBIN 




0.451 


FLJ00011 




0.2725 


FLJ0001 1 


1 


0.402 


FLJ11215 


1 


0.141 


FLJ11215 


1 


0.2065 


FLJ12615 


1 


0.157 


FLJ12615 


1 


0.26 


FLJ21687 




0.9965 


FLJ21687 


1 


0.8225 


GRIP 1 


4 


0.402 


GRIP 1 


4 


0.339 


GRIP 1 


5 


0.405 


! GRIP 1 


5 


0.3185 


GRIP 1 


6 


0.3795 


GRIP 1 


6 


0.177 


GRIP 1 


7 


0.26 


GRIP 1 


7 


0.187 


HEMBA 1003117 


1 


0.558 


HEMBA 1003117 


1 


0.415 


HEMBA 1003117 


1 


0.5875 


HEMBA 1003117 


1 


0.8515 


INADL 


1 


0.336 


INADL 


1 


0.5975 


INADL 


3 


1.095 


INADL 


3 


2.1295 


INADL 


4 


0.6395 


i INADL 


4 


1.049 


INADL 


5 


0.2175 


INADL 


5 


0.3455 


INADL 


7 


0.372 


INADL 


7 


0.5995 


INADL 


8 


0.2785 


INADL 


8 


0.47 


KIAA0316 


1 


0.1965 


KIAA0316 


1 


0.18 


| KIAA0340 


1 


0.855 


K1AA0340 


1 


1.224 
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f GENE NAME 


DOMAIN 


AVERAGE OD 


KIAA0380 




2.061 


KIAA0380 


1 


2.5805 


KIAA0382 


1 


0.2085 


KIAA0382 




0.3865 


KIAA0440 


1 


1.176 


KIAA0440 




0.733 


KIAA0559 


1 


0.2355 


KIAA0559 




0.3155 


KIAA0751 




0.667 


KIAA0751 




1.1525 


KIAA0751 




3.4115 


j KIAA0751 




2.67 


KIAA0858 




0.23 


KIAA0858 


1 


0.3835 


KIAA0967 


1 


0.2555 


KIAA0967 




0.1555 


KIAA1095 




0.2225 


| KIAA1095 




0.328 


KIAA1095 




0.2635 


KIAA1095 




0.3465 


KIAA1222 


1 


0.3325 


KIAA1222 




0.2375 


KIAA1284 




0.8405 


KIAA1284 


1 


0.845 


KIAA1415 




0.3215 


KIAA1415 




0.3045 


KIAA1526 




0.209 


KIAA1526 


-j 


0.3675 


KIAA1526 




3.8915 


KIAA1526 




4 


KIAA1526 


2 


0.8305 


KIAA1526 


2 


1.511 


KIAA1526 


2 


0.2085 


KIAA1526 


2 


0.4095 


KIAA1620 


1 


0.231 


KIAA1620 


1 


0.152 


KIAA1719 


1 


0.2835 


KIAA1719 


1 


0.1895 


| KIAA1719 


2 


0.2545 


KIAA1719 


2 


0.203 


KIAA1719 


3 


0.338 


KIAA1719 


3 


0.2555 


j KIAA1719 


4 


2.4485 


KIAA1719 


4 


2.433 


KIAA1719 


5 


0.417 


KIAA1719 


5 


0.356 


| KIAA1719 


6 


0.264 


| KIAA1719 


6 


0.1695 


LIM MYSTIQUE 




0.8755 


LIM MYSTIQUE 




0.8705 


I LIM PROTEIN 


1 


- 0.5305 


I LIM PROTEIN 


1 


0.732 


! LIM-RIL 


1 


0.407 


I LIM-RIL 




0.4955 


\ LIMK1 




0.354 


I LIMK1 




0.3655 


LIMK2 




0.344 


LIMK2 




0.4015 


LU-1 




0.2425 


LU-1 




0.19 
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GENE NAME 


DOMAIN 


AVERAGE OD 


MAGI 1 


1 


0.247 


MAGI 1 


1 


0.365 


MAGI 1 


1 


0.3645 


MAGI 1 


1 


0.4925 


MAGI 1 


3 


0.2915 


! MAGI 1 


3 


0.4715 


\ MAGI 1 


3 


2.564 


MAGI 1 


3 


3664 


MAGI 1 


4 


0.3085 


MAGI 1 


4 


0.4115 


MAGI 1 


5 


0.245 


MAGI 1 


5 . 


0.3925 


MAGI 2 


1 


0.2595 


MAGI 2 


1 


0.1815 


MAGI 2 


2 


0.205 


MAGI 2 


2 


0.136 


MAGI 2 


3 


0.2925 


MAGI 2 


3 


0.1885 


MAGI 2 


4 


0.144 


. MAGI 2 


4 


0.18 


MAGI 2 


5 


0.7415 


MAGI 2 


5 


0.8035 


MAGI 2 


6 


0.763 


MAGI 2 


6 


0.9055 


MAGI 3 


1 


0.272 


MAGI 3 


1 


0.499 


MAGI 3 


2 


0.701 


MAGI 3 


2 


1.192 


MAGI 3 


3 


0.243 


MAGI 3 


3 


0.566 


MAGI 3 


4 


0.2545 


MAGI 3 


4 


0.4775 


MAGI 3 


5 


0.2745 


! MAGI 3 


5 


0.5265 


MAST1 


1 


0.4675 


MAST1 


1 


0.355 


MAST2 


1 


0.6125 


MAST2 


1 


0.5255 


MAST2 




0.98 


MAST2 




1.7505 


MAST4 


1 


0.264 


MAST4 


1 


0.3355 


MINT1 


1 


1.0045 


MINT1 


1 


0.781 


MINT1 


2 


0.299 


MINT1 


2 


0.1895 


MINT1 


1,2 


3.184 


MINT1 


1,2 


3.8385 


MPP1 


1 


0.479 


I MPP1 


1 


0.685 


MPP2 


1 


0.464 


MPP2 


1 


0.318 


MUPP1 


1 


0.4445 


MUPP1 




0.7405 


MUPP1 


2 


0.4995 


MUPP1 


2 


0.5935 


MUPP1 


3 


0.4815 


MUPP1 


3 


0.742 


i! MUPP1 


4 


1.08 


| MUPP1 


4 


1.923 
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GENE NAME 


DOMAIN 


AVERAGE OD 


i MUPP1 


5 


0.3005 


MUPP1 


5 


0.706 


MUPP1 


6 


1.1875 


MUPP1 


6 


1.909 


MUPP1 


7 


0.377 


MUPP1 


7 


0.676 


MUPP1 


8 


0.835 


MUPP1 


8 


1.5405 


MUPP1 


9 


0.2845 


MUPP1 


9 


0.5165 


MUPP1 


10 


0.3165 


MUPP1 


10 


0.514 


MUPP1 


11 


0.309 


MUPP1 


11 


0.6785 


MUPP1 


12 


0.23 


! MUPP1 


12 


0.3145 


MUPP1 


13 


0.5555 


MUPP1 


13 


0.842 


NEDLG 


1 


0.2175 


NEDLG 


1 


0.143 


NEDLG 


2 


0.159 


j NEDLG 


2 


0.2355 


I NEDLG 


3 


0.137 


NEDLG 


3 


0.2555 


NEDLG 


1,2 


0.3165 


NEDLG 


1,2 


0.401 


N0S1 


1 


0.7285 


N0S1 


1 


0.96 


NOVEL PDZ GENE 


1 


0.8105 


NOVEL PDZ GENE 


1 


2.973 


NOVEL PDZ GENE 


2 


0.363 


I NOVEL PDZ GENE 


i 2 J 


0.844 


OUTER MEMBRANE 


1 


0.21 


OUTER MEMBRANE 


1 


0.4655 


P55T 


1 


0.236 


P55T 


1 


0.1785 


PAR3 


2 


0.2675 


I PAR3 


2 


0.2085 


PAR3 


3 


1.451 


PAR3 


3 


1.2735 


PAR6 


1 


0.381 


PAR6 


1 


0.568 


PAR6 GAMMA 


1 


0.2065 


| . PAR6 GAMMA 


1 


0.2425 


PDZ-73 


2 


0.251 


PDZ-73 


2 


0.4365 


PDZ-73 


3 


0.2225 


I PDZ-73 


3 


0.369 


PDZK1 


1 


0.3415 


I PDZK1 


1 


0.608 


PDZK1 


2 


0.29 


PDZK1 


2 


0.4915 


PDZK1 


3 


0.5655 


PDZK1 


3 


0.5355 


PDZK1 


4 


0.199 


PDZK1 


4 


0.2365 


PDZK1 


2,3,4 


0.441 


| PDZK1 


2,3,4 


0.5115 


PICK1 


1 


0.535 


PICK1 


1 


0.769 
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GENE NAME 


DOMAIN 


AVERAGE OD 


PIST 


1 


0.144 


PIST 


1 


0.35 


PRIM 6 


1 


0.3105 


PRIL16 


1 


0.292 


PRIL16 


2 


0.2165 


PRIL16 


2 


0.173 


PRILT6 


1,2 


0.5495 


PR1L16 


1,2 


0.6355 


PSD95 


1 


0.2165 


! PSD95 


1 


0.115 


PSD95 


3 


0.161 


PSD95 


3 


0.1085 


PSD95 


1,2,3 


0.341 


PSD95 


1,2,3 


0.4045 


PTN-4 


• 1 


0.384 


PTN-4 


1 


0.425 


PTPL1 


1 


0.2225 


PTPL1 


1 


0.163 


PTPL1 


2 


1.6145 


PTPL1 


2 


1.452 


PTPL1 


3 


0.1595 


PTPL1 


3 


0.1905 


PTPL1 


4 


0.265 


PTPL1 


4 


0.4135 


PTPL1 


5 


0.1895 


PTPL1 


5 


0.3 


SHANK 1 


1 


0.281 


SHANK 1 


1 


0.2205 


SIP1 




0.332 


| SIP1 




0.205 


i SIT AC 18 




3.8915 


SITAC18 


1 


3.297 


SIT AC 18 




3.8365 


SITAC18 




4 


SYNTROPHIN 1 ALPHA 


1 


1.11 


SYNTROPHIN 1 ALPHA 


1 


1.78 


SYNTROPHIN GAMMA 1 


1 


1.2705 


SYNTROPHIN GAMMA 1 


1 


1.126 


SYNTROPHIN GAMMA 2 




0.265 


I SYNTROPHIN GAMMA 2 


1 


0.156 


TAX2-LIKE PROTEIN 


1 


0.2445 


j TAX2-LIKE PROTEIN 


1 


0.558 


TIAM1 


1 


0.3445 


TIAM1 


1 


0.435 


TIAM2 




0.2445 


I TIAM2 




0.378 


TIP1 


1 


0.802 


TIP1 


1 


1.309 


TIP2 


1 


0.4165 


TIP2 


1 


0.6065 


VARTUL 


1 


0.287 


VARTUL 


1 


0.1525 


VARTUL 


2 


0.3335 


VARTUL 


2 


0.2375 


VARTUL 


3 


0.2985 


VARTUL 


3 


0.1235 


VARTUL 


4 


0.302 


VARTUL 


4 


0.1805 


VARTUL 


1,2 


0.3665 


VARTUL 


1,2 


0.555 
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GENE NAME 


DOMAIN 


AVERAGE OD 


X-1 1 BETA 


1 


1.3435 


X-11 BETA 


1 


1.0755 


X-1 1 BETA 


2 


0.5205 


X-11 BETA 


2 


0.345 


X-11 BETA 


1,2 


2.63 


X-1 1 BETA 


1,2 


3.6965 


ZO-1 


1 


3.758 


ZO-1 


2 


3.0035 


ZO-1 


2 


3.2305 


ZO-1 


3 


0.3305 


ZO-1 


3 


0.7565 


ZO-2 


1 


0.5655 


ZO-2 


1 


0.4095 


ZO-2 


2 


1.3775 


I ZO-2 


2 


1.5355 


j ZO-2 


3 


0.1415 


I ZO-2 


3 


0.2935 


ZO-3 


1 


0.578 


ZO-3 J 


1 


.0.746 


ZO-3 


2 


2.5585 


ZO-3 


2 


3.245 


ZO-3 


3 


0.2365 


ZO-3 


3 


0.4715 


Table 8B: Alpha-2B adrenergic receptor 


I GENE NAME 


DOMAIN 


AVERAGE OD j 


I AF6 




1.988 


j AF6 




2.387 


! AF6 




2.233 


AIPC 




1.5395 


AIPC 




0.576 


j AIPC 




1.028 


j AIPC 




1.7515 


AIPC 


3 


0.404 


! AIPC 


3 


0.788 


AIPC 


4 


1.117 


j AIPC 


4 


0.508 


ALP 




0.953 


ALP 




1 .3375 


APXL1 




2.005 


APXL1 




0.979 


CARD14 




1.8785 


CARD14 




1.144 


CASK 




2.2245 


CASK 




1.905 


CASK 




2.139 


CNK1 




1.3535 


CNK1 




0.8095 


Cytohesin binding Protein 




1.968 


Cytohesin binding Protein 




2.1155 


Cytohesin binding Protein 




1.878 
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GENE NAME 


DUMAIN 


AVbKAUt UU 


DLG1 


A 

1 


i .4y 


DLG1 


1 




DLG1 


2 


A CA"7 

1.597 


DLG1 


2 


1 a one 


DLG1 


3 


AAA 

1 .14 


DLG1 


3 


o none 
2.0895 


DLG1 


1 .2 


2.083 


DLG1 


1 ,2 


O /17Q.C 

2.4f o5 


DLG1 


1 ,2 


O A C AC 

2.1545 


DLG2 


1 


a cca c 
0.OD45 


DLG2 


1 


A OOC 
0.000 


DLG2 


2 


a 7ccc 

0.7655 


DLG2 


2 


1 .3695 


DLG5 


1 


1 .0645 


| DLG5 


1 


A cocc 

0.6255 


DLG5 


2 


0 ncic 

2.2525 


DLG5 


0 
2 


0 on 

2.822 


DLG5 


0 
2 


2.4085 


DLG5 


2 


A A 07C 

1 .1375 


DLG5 


2 


0.568 


DVL2 


1 


*. A A OC 

1 .1 125 


DVL2 


1 


1.962 


DVL3 


1 


2.5155 


- DVL3 


1 


2.0525 


EBP50 


1 


A "71 7C 

0.7175 


EBP50 


1 


1 .3475 


EBP50 




0.6575 


EBP50 




1.14 


EBP50 


1,2 


AAA 
1.14 


EBP50 


A A 

1,2 


A CAQC 

0.6035 


ENIGMA 


1 


A O AC\C 

0.8495 


ENIGMA 


1 


A CA 7C 

1 .5175 


CDDIkl 

ERBIN 


1 


A 70'5C 

0.7835 


ERBIN 


1 


A X A/I C 

1 .4045 


FLJ0001 1 


1 


A CA7C 

0.6075 


FLJ0001 1 


1 


1 .2535 


r~| \A a r\* «- 

I FLJ11215 


1 


A A CAC 

1 .1605 


fljii215 


1 


A CAAC 

0.5095 


m ici c 
rl_J12ol 0 


1 


A CAAC 

O.oOUo 


r— 1 1 a oci c 
rLJIzbiO 


1 


A f\A O 

1 .01 0 


FLJ21687 


1 


A OA /I 

1.204 


CI IO A C07 

FLJ21687 




A OOO 

0.628 


GRIP 1 


4 


0.5325 


GRIP 1 


4 


2.5575 


GRIP 1 


5 


0.6365 


GRIP 1 


5 


0.9375 


| GRIP 1 


6 


1.519 


GRIP 1 


6 


0.993 
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GENE NAME 


PiAli A A IKI 

DOMAIN 


AVERAGE OD 


GRIP 1 


■7 

( 


A *7"7/1 c 

. U. / /40 


ODID 4 

GRIP 1 


■7 
/ 


A DO 
U.OO 


Guanine exchange factor 


1 


A CQ 


Guanine exchange factor 


1 


4 once 
1 .2UbO 


HEMBA 10031 17 


1 


4 OC7C 

1 .3575 


□ rnjiD a ^ Ar\ o 4 a ~? 

HEMBA 10031 17 


1 


A C A C 

0.54b 


HEMBA 10031 17 


1 


A TO AC 

0. /oUb 


HEMBA 10031 17 


1 


1 .432 


INADL 


1 


"1 4 AC 

1 .19b 


iki a r\i 

i INADL 




4 OAAC 

1 .2095 


v INADL 


2 


4 OCOC 

1 .2635 


INADL 


2 


4 OC X c 

1 .2545 


[ INADL 


3 


2.2165 


INADL 


3 


1 .3695 


INADL 


A 

4 


1./99 


iki a r\i 
INADL 


it 

4 


4 CQO 

1 .082 


IKI A r*M 

INADL 


5 


O "1 CO 

2.1b9 


IKI A PM 

INADL 


5 


4 a ac 
1.646 


iki a r\i 

INADL 


-7 

7 


1.925 


iki a n\t 

INADL 


7 


1.331 


IKI A PM 

INADL 


Q 
O 


2.00/0 


INADL 


O 

8 


O X AO C 


i KIAA0316 


1 


1.1905 


' l/IA Anode 

KIAA0316 


1 


A croc 

0.6525 


KIAA0340 


1 


A CrtC 

0.606 


KIAA0340 


1 


1.175 


1/ 1 a Anoon 

KIAA0380 


1 


O A A O 

2.442 


KIAA0380 


1 


4 OA4 C 

1.8915 


IS I A AftQOn 

KIAA0380 


1 


O 7*3 4 

2./31 


l/l a ArtOOO 

| KIAA0382 


1 


A C"7X C 


is 1 a a aooo 
1 KIAA0382 


1 


AAA 7C 

1 .11 / b 


i> IX 1 A A A A A A 

1 KIAA0440 


1 


O CTM C 

2.b/1 b 


1 1/ 1 a a a x x a 

KIAA0440 


1 


4 7C 4 C 

1 .761 5 


IX I A a ax >in 

KIAA0440 


1 


O AQ4 C 

2.9815 


ix 1 a Artccn 

KIAA0559 


1 


4 004 C 

1.3815 


ix 1 a Anccn 

j KIAA0559 


1 


4 C"7"7 
1 .6/ / 


i l/IA A fl7C 4 

KIAA0751 


1 


4 CftOC 

1 .b93o 


l/IA A r\7C 4 

KIAAU/01 


1 


2.14/0 


IX 1 A A ft?C 4 

r\IAAU/0l 


1 


4 x qc 
1 .480 


IX 1 A AflOCO 


1 


4 7COC 

1 . /bob 


1 IX 1 A AHOCO 

KIAA0858 


1 


4 4 O X 

1.134 


ix 1 a a nnc "7 

KIAA0967 




A CAX 

0.504 


KIAA0967 




0.869 


KIAA1095 




1.5 


KIAA1095 




0.8115 


KIAA1222 




0.9555 


KIAA1222 




0.57 


KIAA1284 




0.5985 
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prkir MAKJIC 

oblNt INAMb 


nr^hji aim 
UvJMAIfN 


AV trV\OC UU 


k"IAA19Q/l 
MAA1zo4 




1 ^7 


MAA (41 0 




n ^Qfi 
u.oyo 


1 A A 1 A 1 C. 
r\IAA141 0 


1 


£.0000 




1 


u.oooo 


MAAl 0^0 


1 


1 A&? 


i/|A A1 coe 
MAA1 0^0 


1 


1 ^90^ 


K\ A A1 OR 




u.yo i 


! k'lAAIc.OC 


4 


n race 
u.oooo 


] l/IA Ai^OC 
j 04O 


4 


1 .OOY 0 


If IA A1 COC 




U.OU4 


l/IA A1 COfi 

mAAl 040 


4 


U.004 


|/|A A1 con 
r\IAA1 04U 


■1 
1 


U.O/ 0 


am con 


<1 

1 


1 Qoot; 

i .yo4o 


I/IAA1 71 Q 

maa i / i y 


1 


U.O 1 40 


I A A1 71 Q 

maa i / 1 y 


1 


1 ^ 

1 .0 


tfl A A 1 71 O 

r\iAAi/iy 


4 


1 //ft 

1 .44o 


! 1 A A 1 71 Q 

MAA1 / iy 


4 


u.oyoo 


l^l A A 1 71 Q 

MAA1 ( i y 


' o 
O 


q cone 

O.OoUO 


i/|A A1 71 Q 

MAA1 ( \ y 


O 


z.ol o 


If I A A1 71 Q 

r\iAAi /i y 


4 


U.O^O 


Lf\ a A 1 71 Q 

maai / 1 y 


4 


1 no/ 
i .uy4 


If 1 A A 1 71 Q 
MAA1 / 1 y 


c 
0 


U.OOOO 


If 1 A A 1 71 Q 

isiaai / 1 y 


c 

0 


1 .0000 


|/|A A1 71 Q 

MAA1 f i y 


O 


U.oO 


1/'1AA171Q 

MAA1 fiy 


O 


1 71 *3 


LIM mystique 




U.OOOO 


LIM mystique 


1 


n 7HC 
U. f Uu 


LIM protein 




1 ft/ 7 


LIM protein 


1 


o ino 


LIM-KIL 




1 oi m 
i .yi i o 


j LIM-KIL 




111 fit; 


LIMrvl j 




1 GE1 C 
1 .DOT 0 


LIMfx 1 




1 7Q***% 


LllVlfVZ 




z.yoo 


Liivjr\z 




9 1QR 

z.iyo 


1111 
LLM 




n 71 ft 


I I i 1 
LU- I 




U.OZ f 0 


MAf^l 1 




I .0000 


MAf^l 1 




n ofiR<; 
u.yooo 






Q C1 DC 

o.O I 00 






O.Zol 


maoi 1 


o 
o 


1 .ooo 


MAGI 1 


3 


1 .2295 


MAGI 1 


3 


1.4925 


MAGI 1 
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GENE NAME 


r\r\KA a iki 
DUMAIN 


av/cda^c nr\ 
AVERAGE UU 
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GENE NAME 


DOMAIN 


AVERAGE OD j 


ZO-3 


2 


3.742 


ZO-3 


2 


0 


ZO-3 


3 


3.4125 


ZO-3 


3 


0 


Table 8C: Alpha-2C adrenergic receptor 


GENE NAME 


DOMAIN 


AVERAGE OD 


AF6 


1 


1.943 


AF6 


1 


1.7465 


AIPC 


1 


1.6195 


AIPC 


1 


2.454 


AIPC 


1 


3.4005 


AIPC 


1 


2.5865 


AIPC 


3 


1.714 


AIPC 


3 


1.3765 


| AIPC 


4 


1.8395 


| AIPC 


4 


1.6645 


ALP 


1 


3.093 


ALP 


1 


1.8765 


I APXL1 


1 


2.002 


APXL1 




3.4065 


CARD14 




4 


CARD 14 




4.0725 


CASK 




2.113 


CASK 




1.8105 


CNK1 


1 


1.862 


CNK1 


1 


2.769 


Cytohesin binding Protein 


1 


2.343 


Cytohesin binding Protein 


1 


2.0315 


DLG1 


1 


1.0915 


DLG1 


1 


1.677 


DLG1 


2 


1.8005 


DLG1 
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1 .2895 


DLG1 
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1 .9495 


DLG1 
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DLG1 
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DLG1 
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2.2305 


DLG5 


1 
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GENE NAME 
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DOMAIN 
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GENE NAME 


DOMAIN 


AVERAGE OD 
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6 


2.28 


MUPP1 


6 


2.3995 


MUPP1 


7 


1.871 


MUPP1 


7 


1 .8225 


MUPP1 


8 


1.5675 


MUPP1 


8 


1.779 


MUPP1 


9 


1 .804 


MUPP1 


9 


1 .8075 


MUPP1 


10 


1 .8495 


MUPP1 


10 


1 .9885 


MUPP1 


11 


1 .456 


MUPP1 


1 1 


1.856 


MUPP1 


12 


1.236 


MUPP1 


12 


1 .7415 


Ml IPP1 




1.377 


MUPP1 


13 


2.7545 


NeDLG 


1 


1.0965 


NeDLG 


1 


1.819 


NeDLG 


2 


1.2775 


NeDLG 


2 


1.89 
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GENE NAME 


DOMAIN 


AVERAGE UD 


NeDLG 


3 


oonc 
1 .ZZVD 


NeDLG 


3 


2.24U5 


NeDLG 


1,2 


1 .8635 


NeDLG 


1,2 


1 .902 


N0S1 


1 


o none 
O.02U5 


N0S1 


1 


2.0945 


novel PDZ gene 


1 


2.3495 


novel PDZ gene 


1 


3.235 


novel PDZ gene 


2 


2.3155 


! novel PDZ gene 


2 


o nooc 

2.yooo 


| outer membrane 


1 


1 .2725 


outer membrane 


1 


o n >i o 
2.043 


p55T 


1 


1.3015 


p55T 


1 


1.561 


| PAR3 


3 


1.1185 


! PAR3 


3 


1 .4575 


PAR6 


1 


O CC7 

2.567 


PAR6 


1 


o nn7 

2.997 


PAR6 GAMMA 


1 


1.345 


PAR6 GAMMA 


1 


■I none 
1 .0305 


PDZ-73 


2 


1.965 


PDZ-73 


2 


O -1 ACC 

3.1455 


PDZK1 


1 


1 .4275 


PDZK1 


1 


1 .9785 


PDZK1 


2 


1.049 


PDZK1 


2 


1.876 


PDZK1 


3 


A A 4 AC 

1 .4145 


PDZK1 


3 


1.9415 


PDZK1 


4 


1.6115 


PDZK1 


4 


1.913 


PDZK1 


2,3,4 


A OA ne 

1 .8195 


PDZK1 


2,3,4 


1.8 


i PICK1 


1 


2.2435 


• PICK1 


1 


o nn a 
3.094 


DIPT 

PIST 


1 


h nnc 
1 .005 


DIPT 

PIST 


1 


a onnc 

1 .2995 


pr!L16 


1 


A vl <1 O 
1 .41 O 


prlLlo 




a ncoc 
1 .0525 


pr!L16 


2 


a one 

1.306 


prlL16 


2 


<i no-i c 

1 .0315 


prlL16 


1,2 


1 .6955 


prlL16 


1 ,2 


o ceo 


I PSD95 ! 


1 


1 .2595 


; PSD95 


1 


2.0535 


PSD95 


3 


1.191 


PSD95 


3 


1.718 


PSD95 


1,2,3 


2.4695 


PSD95 


1,2,3 


3.968 



115 



CjENE NAME 


DOMAIN 


AVERAbE OD 


OTM A 

r 1 N-4 


A 
1 


1 .OfO 


OTM A 

r 1 N-4 


1 


z.8045 


nTDi a 

PTPL1 


1 


r\ oa oc 

0.8135 


DTDI A 

PTPL1 


A 
1 


4 A A A C 
1.1115 


nTDi a 

PTPL1 


2 


A 070 

1 .378 


PTPL1 


2 


2.249 


DTni A 

j PTrLI 


3 


0.5945 


OTni A 

PTPL1 


3 


O A C7C 

2.1675 


PTPL1 


4 


1.0465 


PTPL1 


4 


A OCA 

1.851 


PTPL1 


5 


3.292 


PTPL1 


5 


2.0565 


Shank 1 


1 


1 .9 


Shank 1 


1 


1.656 


pin* 

SIP1 


2 


1 .5845 


SIP1 


2 


2.137 


SITAC 18 


1 


a cnnr 

1.5095 


O IT Af AO 

SITAC 18 


1 


A AOQ 

4.088 


O IT A P H O 

SITAC 18 


2 


2.0615 


O IT A P HO 

SITAC 18 


2 


4 


SYNTENIN 


1 


1 .3545 


SYNTENIN 


1 


Oil 7C 

2.2475 


ONyKIT^KIIKI 

SYNTENIN 


2 


1 .36 


SYNTENIN 


2 


2.5975 


Syntrophin 1 alpha 


1 


2.5625 


Syntrophin 1 alpha 


1 


3.513 


Syntrophin gamma 1 


1 


1.5995 


Syntrophin gamma 1 


A 
1 


A CCA 

1.551 


Syntrophin gamma 2 


1 


A AO A 

1 .Ozl 


Syntrophin gamma 2 


1 


H IOCC 

1 .1855 


TAVO lILr^ M»m4 M Sm 

TAX2-like protein 


1 


O ACE 

2.065 


TAX2-like protein 


1 


3.268 


TIAM1 


1 


2.519 


Tl A kAA 

TIAM1 


1 


3.0965 


Tl A Ik ilO 

y TIAM2 


A 
1 


a cone 

1 .5895 


Tl A fcJlO 

TIAM2 


1 


4 AAO 

1 .908 


1 Irl 


1 


Z.941 


1 IP1 | 


A 
1 


O C07C 


Tl no 

! TIP2 i 


1 


O A O 4 


TIP2 


1 




Vartul 


A 
1 


1 .0885 


vartul 


1 


A AC A 

1 .454 


Vartul i 


2 


1.663 


Vartul 


2 


1.0935 


Vartul 


3 


1.174 


Vartul 


3 


2.1 


Vartul 


4 


1.24 


Vartul 


4 


1.8935 
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r r>CMC MAMC 

OtNb iNAMb 


nnk a aim 
UUMAIlN 


AVLKAvjc UL) 


vartui 


a 0 
1,Z 


i .byo 


\ In r+i il 

vanui 


i,z 


A CD7C 


x-n oeta 


1 


Z.OOOO 


V A A kn(n 

! x-n oeta 


1 


Z.lzo 


x-n oeta 


0 

z 


AAA 7C 

1 .41 (0 


i x-n beta 


0 

z 


a n 
1 .ol 


i V AA kft+n 

1 x-n beta 


1,2 


A A QIC 

1 .looO 


a- 11 beta 


1 ,2 


A ~7A QC 

1 . ( 1o0 


~fr\ a 


1 


«i nice 

1 .yobo 


ZU-1 


1 


A ~7A C 

l.no 


ZO-1 


z 


O A*} ACL 

z.4z4o 


| ZO-1 


z 


z.blz 


~7 r°\ a 
ZU-1 


0 

0 


■1 700 
l./oo 


ZU-1 




i .izy 


zu-z 


1 


A 007C 
1 .ZZ/ 0 


zu-z 


A 
1 


■1/1-1 oc 

1 .41 ZO 


zu-z 


z 


O A A A 
Z.141 


ZU-z 


0 
z 


1 .ob/ O 


ZU-z 


Q 
0 


z.lU/ 


ZU-Z 


Q 
O 


i .zy4 


! ZO-3 


1 


1.637 


ZO-3 


1 


2.612 


ZO-3 


2 


1.418 


| ZO-3 


2 


2.376 


ZO-3 


3 


1.2515 


j ZO-3 


3 


1.6585 



Table 9: Disorders/biological processes demonstrated to be affected to alpha adrenergic 



5 modulation 



Receptor 


Disorder/process 


Al 


Depression 


Al 


Lower Urinary Tract Storage 


Al 


Migraine 


Al 


Prostate apoptosis 


Al 


Hypertrophy, proliferation and migration of 
vascular smooth muscle after carotid injury 


A2 


Migraine 


A2 


Coronary Flow Reserve following stenting 


A2 


Alzheimer's 
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Receptor 


Disorder/process 


A2 


Parkinson's 


A2 


Neuroprotection 


A2 


Glaucoma 


A2 


Opioid withdrawl 



Conclusions and Summary 

Table 8A, 8B and 8C are the first demonstrations that we have discovered of alpha 2 
5 adrenergic receptor (A2R) interactions with PDZ domains. Alpha 1 adrenergic receptors (AIR) 
also contain PL sequences at their C-termini, but different that AIRs, implying binding to a 
different subset of PDZ domains. A single report demonstrated an interaction between alpha 1 
receptor A and NOS1 (a PDZ protein; Pupo et al. (2002) BMC Pharmacology 2:17), but the 
authors demonstrated that this interaction was not dependent on the PL of the Al A adrenergic 
10 receptor. Without intending to be limiting, blocking interactions between apha adrenergic 
receptors and PDZ domains can modulate the effect of signaling through these receptors and 
provide a new set of therapeutic targets for treatment of diseases or disease stemming from 
misfunctioning biological processes such as those listed in Table 9. 

15 *** 

The present invention is not to be limited in scope by the exemplified 
embodiments which are intended as illustrations of single aspects of the invention and any 
sequences which are functionally equivalent are within the scope of the invention. Indeed, 
various modifications of the invention in addition to those shown and described herein will 
20 become apparent to those skilled in the art from the foregoing description and accompanying 
drawings. Such modifications are intended to fall within the scope of the appended claims. 

All publications cited herein are incorporated by reference in their entirety and 
for all purposes. 
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Table 2. 



PL AVC 
ID 


PL 


PL 20Mer Sequence 


PDZ 


PDZ 
Domain 


Binding 
Strengt 
h 


AA250 


5HT3A (serotonin receptor 5- 
hydroxytryptamine 3A) 


LAVLAYS ITL VM LWS 1 WQ YA 
(SEQ ID NO:29) 


HEMBA 1003117 


1 


2 


AA250 


5HT3A (serotonin receptor 5- 
hydroxytryptamine 3 A) 


LAVLAYS ITLVMLWS 1 WQYA 
(SEQ ID NO:29) 


CARD 14 


1 


2 


AA250 


5HT3A (serotonin receptor 5- 
hydroxytryptamine 3A) 


LAVLAYSITLVMLWSIWQYA 
(SEQ ID NO:29) 


MPP2 


1 


2 


AA233L 


5HT2B (serotonin receptor 5- 
hydroxytryptamine 2B) 


DTLLLTENEGDKTEEQVSYV 
(SEQ ID NO:30) 


MAGI 1 


3 


4 


AA233L 


5HT2B (serotonin receptor 5- 
hydroxytryptamine 2B) 


DTLLLTENEGDKTEEQVSYV 
(SEQ ID NO:30) 


HEMBA 1003117 


1 


2 


AA233L 


5HT2B (serotonin receptor 5- 
hydroxytryptamine 2B) 


DTLLLTENEGDKTEEQVSYV 
(SEQ ID NO:30) 


HEMBA 10031 17 


1 


1 


AA233L 


5HT2B (serotonin receptor 5- 
hydroxytryptamine 2B) 


DTLLLTENEGDKTEEQVSYV 
(SEQ ID NO:30) 


KIAA0316 


1 


1 


AA233L 


5HT2B (serotonin receptor 5- 
hydroxytryptamine 2B) 


DTLLLTENEGDKTEEQVSYV 
(SEQ ID NO:30) 


KIAA0807 


1 


1 


AA233L 


5HT2B (serotonin receptor 5- 
hydroxytryptamine 2B) 


DTLLLTENEGDKTEEQVSYV 
(SEQ ID NO:30) 


KIAA1634 


2 


5 


AA233L 


5HT2B (serotonin receptor 5- 
hydroxytryptamine 2B) 


DTLLLTENEGDKTEEQVSYV 
(SEQ ID NO:30) 


KIAA0807 


1 


1 


AA233L 


5HT2B (serotonin receptor 5- 
hydroxytryptamine 2B) 


DTLLLTENEGDKTEEQVSYV 
(SEQ ID NO:30) 


Mintl 


2 


1 


AA233L 


5HT2B (serotonin receptor 5- 
hydroxytryptamine 2B) 


DTLLLTENEGDKTEEQVSYV 
(SEQ ID NO:30) 


MINT1 


1,2 


1 


AA233L 


5HT2B (serotonin receptor 5- 
hydroxytryptamine 2B) 


DTLLLTENEGDKTEEQVSYV 
(SEQ ID NO:30) 


PTPL-1 


2 


5 


AA233L 


5HT2B (serotonin receptor 5- 
hydroxytryptamine 2B) 


DTLLLTENEGDKTEEQVSYV 
(SEQ ID NO:30) 


PTPL-1 


4 


2 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO:31) 


Magi2 


6 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO:31) 


MAG1 1 


6 


3 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO:31) 


CARD 14 


1 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO:31) 


HEMBA 1003117 


1 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO:31) 


FLJ21687 


1 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO:31) 


APXL1 


1 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO:31) 


HEMBA 1003117 


1 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO:31) 


INADL 


3 


2 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO:31) 


INADL 


4 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpna-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO. 31) 


KIAA0340 


1 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO. 31 ) 


KIAA0751 


1 


3 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO:31) 


KIAA0807 


1 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO. 31) 


KIAA1284 


1 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID NO:31) 


KIAA1526 


1 


4 


AA243 


a2A-AR (modified) (adrenergic receptor 
aipna-ZA^ 


HDFRRAFKKILARGDRKRIV 

^o£Li \U INvJ.o I f 


K1719 


4 


3 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID N0:31) 


LIM-Mystique 


1 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID N0:31) 


Mintl 


1 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQIDNO:31) 


MUPP1 


6 


2 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID N0:31) 


MUPP1 


8 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
alpha-2A) 


HDFRRAFKKILARGDRKRIV 
(SEQ ID N0:31) 


MUPP1 


13 


1 
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AA243 


a2A-AR (modified) (adrenergic receptor 


HDFRRAFKKILARGDRKRIV 


PAR3 


3 


2 


AA243 


a2A-AR (modified) (adrenergic receptor 

alnha 9A\ 


HDFRRAFKKILARGDRKRIV 
(ocu iu i\y*j.o i ) 


PTPL-1 


2 


2 


AA243 


a2A-AR (modified) (adrenergic receptor 

alnha 9A\ 
dipi \a-c.r\) 


HDFRRAFKKILARGDRKRIV 


SITAC-18 


1 


4 


AA243 


a2A-AR (modified) (adrenergic receptor 

alnha 9A\ 
dipnd-ZM ) 


HDFRRAFKKILARGDRKRIV 
rcpn in NO^-n 


SITAC-18 


2 


4 


AA243 


a2A-AR (modified) (adrenergic receptor 
aipna-^Mj 


HDFRRAFKKILARGDRKRIV 

(OCU IU INw.O I ) 


KIAA1526 


2 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 

alnha-9A^ 


HDFRRAFKKILARGDRKRIV 


X11-beta 


1 


2 


AA243 


a2A-AR (modified) (adrenergic receptor 


HDFRRAFKKILARGDRKRIV 
/cpn in wrv*i\ 

(OCU IU WKJ.O I) 


X11-beta 


2 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 

alnha-9 A \ 

aipna-t/v 


HDFRRAFKKILARGDRKRIV 


ZO-1 


2 


4 


AA243 


a2A-AR (modified) (adrenergic receptor 

nlnkn O A \ 

ajpna-^/\j 


HDFRRAFKKILARGDRKRIV 

|OCU IU INU.O I } 


ZO-2 


2 


2 


AA243 


a2A-AR (modified) (adrenergic receptor 
aipna-/:A j 


HDFRRAFKKILARGDRKRIV 
fc,pn in 

(OlU IU INU.O I ) 


ZO-3 


2 


3 


AA243 


a2A-AR (modified) (adrenergic receptor 
aipna-iiM i 


HDFRRAFKKILARGDRKRIV 
jqph in wn^^ 


DLG5 


2 


1 


AA243 


a2A-AR (modified) (adrenergic receptor 
aipna-^/\) 


HDFRRAFKKILARGDRKRIV 

[OlU IU INw.O I J 


AIPC 


1 


2 


AA243 


a2A-AR (modified) (adrenergic receptor 

alnha 9 A ^ 


HDFRRAFKKILARGDRKRIV 

[OCU IU INW.O 1 ; 


Syntrophin gamma-1 


1 


2 


AA243 


a2A-AR (modified) (adrenergic receptor 

dipi la-£.r\) 


HDFRRAFKKILARGDRKRIV 
f^FO in 

(OCU IU INVJ.O I J 


Magi2 


5 


1 


AA244 


a2B-AR (modified) (adrenergic receptor 

aipild-£DJ 


QDFRRAFRRILARPWTQTAW 
f^FO in ND 


PSD95 


1,2,3 


5 


AA244 


a2B-AR (modified) (adrenergic receptor 

alnha 9R\ 


QDFRRAFRRILARPWTQTAW 
jcfh in Kin-^9^ 


TIP1 


1 


5 


AA244 


a2B-AR (modified) (adrenergic receptor 

alnha-9R\ 

aipna-^D; 


QDFRRAFRRILARPWTQTAW 
fcpn in wo-^9^ 


KIAA0807 


1 


4 


AA244 


a2B-AR (modified) (adrenergic receptor 

alnha 9R\ 

aipna-ZD) 


QDFRRAFRRILARPWTQTAW 

(OCU IU INVJ.JZJ 


KIAA0303 


1 


4 


AA244 


a2B-AR (modified) (adrenergic receptor 

alnha 9R\ 

aipna-zDj 


QDFRRAFRRILARPWTQTAW 

/cpn in MfV*9\ 


MAGI 1 


2 


4 


AA244 


a2B-AR (modified) (adrenergic receptor 

alnha-9R\ 

aipna-^pj 


QDFRRAFRRILARPWTQTAW 
/cpn in Nn-19\ 


MAGI 1 


4 , 


5 


AA245 


a2C-AR (modified) (adrenergic receptor 

alnha 9f^ 

aipna-£i>j 


DFRPSFKH 1 LFRRARRGFRQ 
pru iu wkj.oo) 


GRIP1 


5 


1 


AA245 


a2C-AR (modified) (adrenergic receptor 

alnha 9P\ 

aipna-^o ) 


DFRPSFKHILFRRARRGFRQ 
/cpn in Kin-^^^ 


LU1 


1 


4 


AA245 


a2C-AR (modified) (adrenergic receptor 


DFRPSFKHILFRRARRGFRQ 
/cpn in Nin-^^ 

[OCU IU INU.OOI 


PTPL-1 


5 


3 


AA245 


a2C-AR (modified) (adrenergic receptor 

alnha 9P\ 
diprid-ZL/ ) 


DFRPSFKHILFRRARRGFRQ 
/cpn in Mn-?7\ 


APXL1 


1 


3 


AA245 


a2C-AR (modified) (adrenergic receptor 

dipild-^VsJ 


DFRPSFKHILFRRARRGFRQ 
fc.pn in no* 


KIAA1719 


3 


5 


AA245 


a2C-AR (modified) (adrenergic receptor 

alnha-9P\ 


DFRPSFKHILFRRARRGFRQ 
/cpn in Nin ^ 


Mint 1 


2 


3 


AA245 


a2C-AR (modified) (adrenergic receptor 

alnha-9P\ 


DFRPSFKHILFRRARRGFRQ 
^cfo in wn ^^\ 


MUPP1 


4 


3 


AA245 


a2C-AR (modified) (adrenergic receptor 

alnha_9f^ 
dipi \<A-c.\j ) 


DFRPSFKHILFRRARRGFRQ 
/cpn m Nrv??i 


KIAA0973 


1 


3 


AA245 


a2C-AR (modified) (adrenergic receptor 

alnha-9r\ 


DFRPSFKHILFRRARRGFRQ 
/cpn in no w\ 


CARD 14 


1 


5 


AA245 


a2C-AR (modified) (adrenergic receptor 

alnha. 9C\ 
dipild-Z\b> } 


DFRPSFKHILFRRARRGFRQ 
/cpn m Mn-7^ 

(OlU IU l^^J.OO) 


DVL2 


1 


3 


AA252 


ACM3 (muscarinic acetylcholine 

rprpntnr 

ICLCpiUI I VI O / 


QQYQQRQSVIFHKRAPEQAL 


APXL1 


1 


1 


AA252 


ACM3 (muscarinic acetylcholine 

rpr.pntor 


QQYQQRQSVIFHKRAPEQAL 
fSEQ ID NO-341 


KIAA0807 


1 




AA252 


ACM3 (muscarinic acetylcholine 
receptor M3) 


QQYQQRQSVIFHKRAPEQAL 
(SEQ ID NO:34) 


KIAA0807 






AA252 


ACM3 (muscarinic acetylcholine 
receptor M3) 


QQYQQRQSVIFHKRAPEQAL 
(SEQ ID NO:34) 


AIPC 






AA181 


BAI-1 (brain-specific angiogenesis 
inhibitor 1) 


RSGATIPLVGQDIIDLQTEV 
(SEQ ID NO:35) 


TIP1 






AA181 


BAI-1 (brain-specific angiogenesis 
inhibitor 1) 


RSGATIPLVGQDIIDLQTEV 
(SEQ ID NO:35) 


KIAA1526 







120 



AA181 


BAI-1 (brain-specific angiogenesis 
inniDiior i J 


RSGATIPLVGQDIIDLQTEV 

(OCU IU N^J.OD) 


PSD95 


2 


1 


AA181 


BAI-1 (brain-specific angiogenesis 

inKIKi+r\r 1 \ 

inniuiior ii 


RSGATIPLVGQDIIDLQTEV 

\0 L W IU INU.OJ ) 


TIP 43 


1 




AA181 


BAI-1 (brain-specific angiogenesis 

inhShitrtr 1 \ 

inniDiior i ) 


RSGATIPLVGQDIIDLQTEV 


NeDLG 


1,2 




AA181 


BAI-1 (brain-specific angiogenesis 

llMllUllCJr 1 ) 


RSGATIPLVGQDIIDLQTEV 
fc.Fo in kio-'w 


KIAA0973 


1 


1 


AA181 


BAI-1 (brain-specific angiogenesis 

inniDiior i ) 


RSGATIPLVGQDIIDLQTEV 

(OlW IU \\<J.OD) 


INADL 


3 


1 


AA181 


BAI-1 (brain-specific angiogenesis 

IIHIIUIIUI I ) 


RSGATIPLVGQDIIDLQTEV 


DLG2 


2 


1 


AA45 


BLR-1 (Burkitt's lymphoma receptor 1) 


PSWRRSSLS ESENATSLTTF 
/cpn m Mn-^fi^ 

(Olw IU INw\00^ 


KIAA0561 


1 


1 


AA45 


BLR-1 (Burkitt's lymphoma receptor 1) 


PSWRRSS LS ES E NATSLTTF 
fc.po in Kin 


PDZK-1 


2 


1 


AA45 


BLR-1 (Burkitt's lymphoma receptor 1) 


PSWRRSSLS ESENATSLTTF 

/cpn m MO-^fi^ 


KIAA0807 


1 




AA45 


BLR-1 (Burkitt's lymphoma receptor 1) 


PSWRRSS LS ESE NATS LTTF 
^qfo in wn ^R^ 


PDZK1 


2,3,4 


1 


AA45 


BLR-1 (Burkitt's lymphoma receptor 1) 


PSWRRSSLSESENATSLTTF 
/cpn in mo-^r^ 


SHANK 


1 


1 


AA45 


BLR-1 (Burkitt's lymphoma receptor 1) 


PSWRRSSLSESENATSLTTF 
/qfo in Mf>7fi\ 

(OlU IU INW.OOJ 


KIAA0807 


1 


2 


AA269 


C5AR (C5a anaphylatoxin chemotactic 


ESKSFTRSTVDTMAQKTQAV 
/qfo in no-*^ 

\OlW IU IMw\Of ) 


PTPL-1 


4 


1 


AA29.2 


IL8RB (lnterleukin-8 receptor B) 


KDSRPSFVGSSSGHTSTTL 
/cpn m wo ir\ 


KIAA0807 


1 


5 


AA29.2 


IL8RB (lnterleukin-8 receptor B) 


KDSRPSFVGSSSGHTSTTL 
f°,FO in NO?8^ 


SHANK 


1 


3 


AA29.2 


IL8RB (lnterleukin-8 receptor B) 


KDSRPSFVGSSSGHTSTTL 
r9FO in NO m\ 


KIAA0382 


1 


2 


AA29.2 


IL8RB (lnterleukin-8 receptor B) 


KDSRPSFVGSSSGHTSTTL 

JC.FO in MO ^fl^ 

[OlU IU INW.JQj 


KIAA0807 


1 


5 


AA215 


CKR5 (CC Chemokine receptor type 5) 


ERASSVYTRSTGEQEISVGL 

/cpn m NO-^CN 


KIAA1719 


5 


1 


AA215 


CKR5 (CC Chemokine receptor type 5) 


ERASSVYTRSTGEQEISVGL 


KIAA1719 


2 


1 


AA215 


CKR5 (CC Chemokine receptor type 5) 


ERASSVYTRSTGEQEISVGL 
/c.fo in NO-^crt 


TAX IP2 


1 


1 


AA215 


CKR5 (CC Chemokine receptor type 5) 


ERASSVYTRSTGEQEISVGL 
/c.fo in no-^cn 


TIP1 


1 


1 


AA215 


CKR5 (CC Chemokine receptor type 5) 


ERASSVYTRSTGEQEISVGL 
^qfo in no^q^ 


MINT1 


1,2 


1 


AA215 


CKR5 (CC Chemokine receptor type 5) 


ERASSVYTRSTGEQEISVGL 
/cpn m NO^Q\ 


KIAA1634 


1 


1 


AA124 


CXCR3 (C-X-C Chemokine receptor 
type 3) 


SSSRRDSSWSETSEASYSGL 
/ceo in no-40^ 

IOlw IU INW.*tUy 


ELFIN 1 


1 


1 


AA124 


CXCR3 (C-X-C Chemokine receptor 
type o) 


SSSRRDSSWSETSEASYSGL 
/ccn in NO-4CN 

iOCU IU INw\huj 


KIAA0807 


1 




AA124 


CXCR3 (C-X-C Chemokine receptor 
type o) 


SSSRRDSSWSETSEASYSGL 
/cpn m no-4h^ 

(Olw IU INw\H-U^ 


KIAA0807 


1 


1 


AA114 


GLUR7 (metabotropic glutamate 


VDPNSPAAKKKYVSYNNLVI 
/QFO m NO'41^ 

(Olw IU i / . H i J 


KIAA1634 


1 


1 


AA114 


GLUR7 (metabotropic glutamate 

fCUcpiUF f ) 


VDPNSPAAKKKYVSYNNLVI 
/CFO in NO-41^ 

(OlU IU IMU.t 1 / 


DLG1 


2 


1 


AA114 


GLUR7 (metabotropic glutamate 


VDPNSPAAKKKYVSYNNLVI 
/cfo ID NIO-41 ) 

(O Q W IU INU.t I ) 


PAR3 


3 




AA29.3 


IL-8RA (lnterieukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 
r^FO m NO-4?^ 

(OlU IU WKJ.Hc.) 


KIAA0807 


1 


1 


AA29.3 


IL-8RA (lnterieukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 

(OLW 1 \—J W\J.'-t£,f 


KIAA0380 


1 




AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 
(SEQ ID NO:42) 


KIAA1634 


1 




AA29.3 


1L-8RA (lnterieukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 
(SEQ ID NO:42) 


MAGI 1 


2 




AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 
(SEQ ID NO:42) 


PSD95 


1,2,3 




AA29.3 


IL-8RA (lnterieukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 
(SEQ ID NO:42) 


MAGI 1 


6 




AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 
(SEQ ID NO:42) 


Syntrophin 1 alpha 


1 





121 



AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 
/cpn in hir\ A7\ 

(OlU IU IMU.tt; 


KIAA1634 


5 


1 


AA29.3 


IL-8RA < lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 
/cpn in 

(OlU lU INU.ttl 


MUPP1 


13 


1 


M29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


novel PDZ gene 


2 




AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


PDZK1 


2,3,4 




AA29.3 


IL-8RA (lnterieukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 
/ccrj in Mn-40\ 

[OlU 1 U WKJ.ttc.) 


TIP1 


1 


1 


AA330 


P2Y1 (P2YPurinoceptor1) 


SEDMTLNILPEFKQNGDTSL 


ELFIN 1 


1 


1 


AA330 


P2Y1 (P2Y Purinoceptorl) 


SEDMTLNILPEFKQNGDTSL 
/ccn in moa*^ 

(OtU IU ri\J. e l0) 


KIAA0807 


1 


1 


AA330 


P2Y1 (P2Y Purinoceptorl) 


SEDMTLNILPEFKQNGDTSL 

[OlU IU ri\J.Ho) 


Magi2 


5 


1 


AA330 


P2Y1 (P2Y Purinoceptorl) 


SEDMTLNILPEFKQNGDTSL 

(otU IU rMvJ.4«3J 


KIAA0316 


1 


1 


AA330 


P2Y1 (P2Y Purinoceptorl) 


SEDMTLNILPEFKQNGDTSL 


EBP50 


1 


1 


AA330 


P2Y1 (P2Y Purinoceptorl) 


SEDMTLNILPEFKQNGDTSL 
/ccn in wn-A7\ 


KIAA0807 


1 


1 


AA330 


P2Y1 (P2Y Purinoceptorl) 


SEDMTLNILPEFKQNGDTSL 
/ocn in kicva^ 

[O LvJ IU IN w.HO J 


APXL1 


1 


1 


AA330 


P2Y1 (P2Y Purinoceptorl) 


SEDMTLNILPEFKQNGDTSL 
[OlU iu iNU.to; 


PTN-4 


1 




AA330 


P2Y1 (P2Y Purinoceptorl) 


SEDMTLNILPEFKQNGDTSL 
^cfo in wrvA^ 

(OlU IU ri^J.HO) 


EBP50 


2 


1 


AA268 


PTR2 (Parathyroid hormone receptor) 


RPMESNPDTEGAQGETEDVL 

[OlU IU INU-Hty 


APXL1 


1 




AA268 


PTR2 (Parathyroid hormone receptor) 


RPMESNPDTEGAQGETEDVL 

/ccn in MO-AAA 


PAR3 


3 




AA205L 


5HT2C (serotonin receptor 5- 
hydroxytryptamine 2C) 


ENLELPVNPSSWSERiSSV 

/ccn m MfYA*%\ 
[OlU IU rixJ.HO) 


MUPP1 


10 


1 


AA205L 


5HT2C (serotonin receptor 5- 
nyaroxyirypiamine ^yj 


ENLELPVNPSSWSERISSV 
/ccn m wn-A^\ 

(OlU IU iMU.hO; 


INADL 


8 


1 


AA248 


SSR4 (somatostatin receptor type 4) 


EALQPEPGRKRIPLTRTTTF 
/ccn m wn-zifi^ 


MAGI 1 


5 




AA248 


SSR4 (somatostatin receptor type 4) 


EALQPEPGRKRIPLTRTTTF 


MAGI 1 


4 


1 


AA248 


SSR4 (somatostatin receptor type 4) 


EALQPEPGRKRIPLTRTTTF 

/ccn m MH-AK> 
[OlU IU INU.fO; 


DLG1 


1,2 


1 


AA248 


SSR4 (somatostatin receptor type 4) 


EALQPEPGRKRIPLTRTTTF 
/ccn m wn.-A.f^ 

[otU IU InU.^DJ 


KIAA0807 


1 


1 


AA248 


SSR4 (somatostatin receptor type 4) 


EALQPEPGRKRIPLTRTTTF 
/o.pn in wn ^ 

[OlU IU INU.tO^ 


MINT1 


1,2 


1 


AA248 


SSR4 (somatostatin receptor type 4) 


EALQPEPGRKRIPLTRTTTF 

[OlU IU INU.*fOy 


PDZK1 


2,3,4 


1 


AA113 


SSTR2 (somatostatin receptor type 2) 


LNETTETQRTLLNGDLQTSI 

/ccn m hiiD A7\ 
[OlU IU INU.t/J 


KIAA0382 


1 


1 


AA113 


SSTR2 (somatostatin receptor type 2) 


LNETTETQRTLLNGDLQTSI 
/opn in 

[OlU IU IMU.*f / J 


KIAA0807 


1 




AA113 


SSTR2 (somatostatin receptor type 2) 


LNETTETQRTLLNGDLQTSI 

/cpn m 

[OlU IU l\U.*t/ ) 


KIAA1526 


1 


1 


AA113 


SSTR2 (somatostatin receptor type 2) 


LNETTETQRTLLNGDLQTSI 
/cpn m Nin-d.7\ 

[OlU IU INU.*+/ ) 


KIAA1719 


6 


1 


AA113 


SSTR2 (somatostatin receptor type 2) 


LNETTETQRTLLNGDLQTSI 
/qpn m hlHA7\ 

[OlU IU WKJ.Hf l 


Mintl 


2 




AA113 


SSTR2 (somatostatin receptor type 2) 


LNETTETQRTLLNGDLQTSI 
fcpn. in Mn-47^ 

[OlU IU INU.f/ j 


SHANK 


1 


1 


AA113 


SSTR2 (somatostatin receptor type 2) 


LNETTETQRTLLNGDLQTSI 

/CPO. in MfYA7^ 
[OlU IU INVJ.*t/ ) 


GRIP1 


7 


1 


AA113 


SSTR2 (somatostatin receptor type 2) 


LNETTETQRTLLNGDLQTSI 
(SEQ ID NO:47) 


KIAA0807 


1 




AA113 


SSTR2 (somatostatin receptor type 2) 


LNETTETQRTLLNGDLQTSI 
{SEQ ID NO:47) 


MINT1 


1,2 




AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 
(SEQ ID NO:48) 


SSTRIP 


1 




AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 
(SEQ ID NO:48) 


MAGI 1 


2 




AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 
(SEQ ID NO:48) 


MAGI 1 


5 
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AA329 


VI PS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 


EBP50 


1 


1 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 


FLJ00011 


1 


2 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 


INADL 


8 


1 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 

[OCU \\J XH\J .HO) 


KIAA0382 


1 


1 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 
/ccn in Nin-4JVi 


KIAA0807 


1 


3 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor z ) 


LQFHRGSRAQSFLQTETSVI 

/ccn in MO-dft^ 


KIAA0807 


1 


3 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor ) 


LQFHRGSRAQSFLQTETSVI 


INADL 


3 


1 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 
/ceo in Mn-Af^ 


KIAA0973 


1 


1 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 

/Cpn in MH-Afi^ 


KIAA1526 


2 


2 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 
/cpn in mo-ar^ 


KIAA1526 


1 


1 


AA329 


VIPS (vasoactive intestinal polypeptide 


LQFHRGSRAQSFLQTETSVI 
fSEQ ID N0 48) 


NSP 


1 


1 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 
(SEQ ID NO:48) 


PIST 


1 


1 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 
(SEQ ID NO:48) 


Shank 1 


1 


2 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 
(SEQ ID NO:48) 


Shank 3 


1 


2 


AA329 


VIPS (vasoactive intestinal polypeptide 
receptor 2) 


LQFHRGSRAQSFLQTETSVI 
(SEQ ID NO:48) 


TIP 43 


1 


1 
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Table 3. 



Gene Name (Synonyms) 


Genbank 
Reference 


Last 20 aa 


Last 4 aa 


PL 

? 


AVC PL 
ID 


Adenosine A1 receptor (AdenoAl R) 


S45235.1 
Gl:256154 


FRCQPAPPIDEDLPEERPDD 
(SEQ ID NO:49) 


RPDD 
(SEQ ID 
NO:50) 






Adenosine A2a receptor (AdenoA2a, 
ADORA2A) 


X68486.1 
Gl:400451 


VCPEPPGLDDPLAQDGAGVS 
(SEQ ID NO:)51 


AGVS 
(SEQ ID 
NO:52) 






Adenosine A2b receptor (AdenoA2b) 


M97759.1 
GM78149 


ADVKSGNGQAGVQPALGVGL 
(SEQ ID NO:53) 


GVGL 
(SEQ ID 
NO:54) 


X 




Adenosine A3 receptor (AdenoA3R) 


AAA16365.1 
Gl:349449 


ACWCHPSDSLDTSIEKNSE 
(SEQ ID NO:55) 


knse 

(SEQ ID 
NO:56) 






Adrenergic receptor alpha 2B (a2BadrR, 
alpha-2B adrenoceptor, subtype C2) 


M34041.1 
GM78197 


QDFRRAFRRILCRPWTQTAW 
(SEQ ID NO:57) 


QTAW 
(SEQ ID 
NO:58) 


X 


AA244 


Adrenergic receptor alpha-1 A (a1 AAdrR, 
Alpha lA-adrenoceptor, Alpha- 1C adrenergic 
receptor). 


U02569.1 
Gl:409028 


HQVPTIKVHTISLSENGEEV (SEQ 
ID NO:59) 


GEEV 
(SEQ ID 
NO:60) 


X 




Adrenergic receptor Alpha-1 A isoform4 
(a1AAdri4) 


AF013261.1 
Gl:2978555 


REHIKHVNFMMPPWRKGLEC 
(SEQ ID NO:61) 


GLEC 
(SEQ ID 
NO:62) 


X 




Adrenergic receptor alpha-IB (a IBAdrR, 
Alpha 1 B-adrenoceptor). 


U03865.1 
G 1:494982 


DVANGQPGFKSNMPLAPGQF 
(SEQ ID NO:63) 


PGQF 
(SEQ ID 
NO:64) 


X 




Adrenergic receptor alpha-1 C isoform 2 
(a1CAdri2) 


D32202.1 
Gl:927208 


FLVETGFHHVGQDDLDLLTS 
(SEQ ID NO:65) 


LLTS 
(SEQ ID 
NO:66) 






Adrenergic receptor alpha-1 C isoform3 
(a1CAdri3) 


D32201.1 
G 1:9272 10 


ITVSKDQSSCTTARGHTPMT 
(SEQ ID NO:67) 


TPMT 
(SEQ ID 
NO:68) 






Adrenergic receptor alpha-ID (a IDAdrR) 


U03864.1 
Gl:494980 


GATCQAYELADYSNLRETDI 
(SEQ ID NO:69) 


ETDI 
(SEQ ID 
NO:70) 


X 




Adrenergic receptor alpha-2A (a2AAdrRec, 
alpha-2A adrenoceptor, subtype C10) 


M23533.1 
Gl:178195 


HDFRRAFKKILCRGDRKRIV 
(SEQ ID NO:71) 


KRIV 
(SEQ ID 
NO:72) 


X 


AA243 


Adrenergic receptor alpha-2C (a2CARC4, 
alpha-2C adrenoceptor, SubtypeC4) 


J03853.1 
Gl:178193 


DFRRSFKHILFRRRRRGFRQ 
(SEQ ID NO:73) 


GFRQ 
(SEQ ID 
NO:74) 


X 


AA245 


Adrenergic receptor Beta- 1a (blAdrRec) 


J03019.1 


DSDSSLDEPCRPGFASESKV 
/qfh m Mn-7^ 

\OlU IU INU. / D) 


ESKV 
/ocn in 

NO:76) 


X 




Adrenergic receptor beta-2a (2AdrRec) 


Y00106.1 

0l,£90/ U 


VPSDNIDSQGRNCSTNDSLL 
/ccn in wn-77\ 


DSLL 
/ccn in 

IOCVJ IU 

NO:78) 


X 




Adrenergic receptor Beta-3 (b3AdrRec) 


X72861.1 


SSPAQPRLCQRLDGASWGVS 

/ccn m NO-7Q\ 


WGVS 
(SEQ ID 
NO:80) 






Adrenocorticotropic hormone receptor (ACTH 
icucpiuf, i n-r\, ivicianouorun-^ reuepiur, 
MC2-R, Adrenocorticotropin receptor) 


X65633.1 


FRSPELRDAFKKMIFCSRYW 

pcu W (NVJ.O 1 ) 


SRYW 
/ccn in 

NO:82) 






Adrenomedullin receptor (AM-R, AdrmedR) 


Y13583.1 
Gl:2652933 


AHHLLPNTSPISPTQPLTPS (SEQ 
ID NO:83) 


LTPS 
(SEQ ID 
NO:84) 






Angiotensin II receptor type 1 (AT1, Anglltl, 
AT1AR) 


M91464.1 
Gl:179121 


RPSDNVSSSTKKPAPCFEVE 
(SEQ ID NO:85) 


FEVE 
(SEQ ID 
NO:86) 






Angiotensin II receptor type-1B (AnglUB, 

AT4D AT"40O\ 

AT1B, AT1BR) 


D13814.1 

Gl. 471 120 « 


RPSDNVSSSTKKPAPCFEVE 

/c*cn 1 pi k m. 0~7\ 

(SEQ ID NO.87) 


FEVE 
(ScU ID 
NO:88) 






Angiotensin II receptor type-2 (Angll2R, AT2) 


U20860.1 
GL747969 


RESMSCRKSSSLREMETFVS 
(SEQ ID NO:89) 


TFVS 
(SEQ ID 
NO:90) 






Apelin receptor (G protein-coupled receptor 
APJ, Angiotensin receptor-like 1, HG1 1) 


U03642.1 
GL425351 


GGEQMHEKSIPYSQETLWD 
(SEQ ID NO:91) 


LWD 
(SEQ ID 
NO:92) 






Blue-sensitive opsin (Blue cone photoreceptor 
pigment, Bluopsin) 


M13299.1 
Gl:1469901 


TCSSQKTEVSTVSSTQVGPN 
(SEQ ID NO:93) 


VGPN 
(SEQ ID 
NO:94) 
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Bombesin receptor subtype-3 (BRS-3) 


1 AQOOO A 

L0oo93.1 
Gl:291876 


bblbv I or I oL»b VfvJAbUKr 

(SEQ ID NO:95) 


tUKr 
(SEQ ID 
NO:96) 


X 




Bradykinin receptor B1 (B1 brady R, BK-1 
receptor, B1 R) 


1 lOO'J/IC A 

Gl:727358 


CI ADIQQCUDk^PIPni F\A/PM 

oLArlooonKftfclrULrVVKiN 
(SEQ ID NO:97) 


(SEQ ID 
NO:98) 






Bradykinin receptor B2 (B2BK2R, BK-2 
receptor, B2R) 


vac a cc a 

Gl:1220160 


I blbvbKUIHKLtJUVVAGbKVJ 

(SEQ ID NO:99) 


(SEQ ID 
NO:100) 


X 




Brain-specific angiogenesis inhibitor 1 
precursor (BA1 1 pre) 


AbL)Uo297. 1 

Gl:2653431 


ncr> ati di \ tc*or\\ mi atc\/ /ccn 
Koon I IHLVoUUIIULU 1 1 V (otU 

ID NO:101) 


OTP\/ 

(SEQ ID 
NO:102) 


X 


AA I O \ 


Brain-specific angiogenesis inhibitor 2 
precursor (BAI2pre, BAI2) 


a onncorto a 

AB005298.1 
Gl:3021698 . 


HRAAAWbr I bHPDGUrU I LV 
(SEQ ID NO:103) 


U I b V 
(SEQ ID 
NO: 104) 


X 




Brain-specific angiogenesis inhibitor 3 
precursor (BAI3pre, BAI3, KIAA0550) 


AdOU5*:99.1 
Gl:3021700 


lA/ci/fi mi di n\/oconcoTC\/ 
(SEQ ID NO:105) 


OTP\/ 
KJ I b V 

(SEQ ID 
NO:106) 


X 




C3a anaphylatoxin chemotactic receptor 
(C3achemR, C3AR) 


1 IOQ/t QQ A 

U2o4oo.1 
Gl:1 199577 


TDCTUPDCMM\/ICCDMCTT\/ 

1 Kb I HorbNNVIbbKINb 1 1 V 

(SEQ ID NO:107) 


b I I V 

(SEQ ID 
NO:108) 


X 




C5a anaphylatoxin chemotactic receptor 
(C5a-R, CD88, CD88 antigen) 


VCQC7/ A 

Xoob/4.1 
Gl:29568 


cci^ccTDCTwnTMAOkTOAV 
tor\or 1 Kb 1 VU 1 MAUK 1 Unv 

(SEQ ID NO:109) 


TOA\/ 
I LiAV 

(SEQ ID 
NO:110) 


X 


A AOCQ 


Calcitonin gene-related peptide type 1 
receptor precursor (CGRPIRpre, CGRP type 
1 receptor, CALCRL, CGRPR, CGRR). 


1 "7CQQ/\ A 

L7boo0.1 
Gl:1321593 


MOL^CILJOICMX/l 1 l/nrM| VM /CCH 

NGKblnUlbN VLLI\rbNI_YN (btU 
IDNO:111) 


Ml VM 
INL YIN 

(SEQ ID 
NO:112) 






Calcitonin receptor precursor (CalcRpre, CT- 
R, CALCR, CALR). 


I nnc o ~7 a 
L005o7.1 

Gl:179879 


nPCCCACIIDI klllCOeCCA /ceo 

UGbbbAbllrLNIIbUbbbA (otU 
ID NO:113) 


bbbA 
(SEQ ID 
NO:114) 


X 




Calcium-mobilizing lysophosphatidic acid 
receptor LP-A3/EDG-7 (EDG7, EDG7#2) 


A C A OOQDn A 

Gl:6003655 


obUYIbUblbUoAVUNKb I b 
(SEQ ID NO:115) 


r\b l b 
(SEQ ID 
NO:116) 






Cannabinoid receptor 1 (CB1, CB-R, CANN6) 


X54937.1 
Gl. 29914 


TVKIAKVTMSVSTDTSAEAL 

/CCO IP\ MA-H7\ 

(obU ID NU.1 If) 


AEAL 
/ceo in 

NO: 118) 


X 




Cannabinoid receptor 2 (CB2, CB-2, CX5) 


X74328.1 
GL4U7oUo 


EADGKITPWPDSRDLDLSDC 
(bbU IU NU.119) 


LSDC 
/ceo in 

NO:120) 


X 




C-C Chemokine binding protein 2 
(Chemokine-binding protein D6, C-C 
chemokine receptor D6, Chemokine receptor 
CCR-9, CC-Chemokine receptor CCR10) 


U94888.1 
Gl. 2213808 


LGERQSENYPNKEDVGNKSA 

/CCO ir\ Kl0>401\ 

(bbU ID NU.121 ) 


NKSA 

/ceo m 
(otU IU 

NO:122) 






C-C nhpmnleini* rpnpntnr R ^CClRfi^ 


AAB57794 1 
Gl:2104521 


PRGQSAQGTSREEPDHSTEV 
(SEQ ID NO:123) 


STEV 
(SEQ ID 
NO:124) 


v 

A 




C-C nhpmnkinp rprpntor QA fOCRQA^ 


AJ 132337 1 
Gl:4886431 


EGSLKLSSMLLETTSGALSL 
(SEQ ID NO:125) 


ALSL 
(SEQ ID 
NO: 126) 


v 

A 




C-C Hhpmokinp rpnpntnr tvnp 1 fMIPIaR C-C 
CKR-1, CCR-1, MIP-1alpha-R, RANTES-R, 
HM145, LD78 receptor) 


L09230 1 
Gl:1 79984 


LERVSSTSPSTGEHELSAGF 
(SEQIDNO:127) 


SAGF 
(SEQ ID 
NO: 128) 






w"L/ UI ICIIlUK.ll IC iCCcpiOi lypc I U (L>0-L»r\r\- 

10, CCR-1 0, G- protein coupled receptor 2) 


Gl:7546844 


RPRI ^P^APTFTHSl ^WDN 
(SEQIDNO:129) 


SWDN 
(SEQ ID 
NO: 130) 






O O rhomnirina ro^Qfitrtr h/no 1 1 (CC~CW&- 

v^-o uiicii IUK.I ric icccpioi iypc i i ^ wo wr\r\ 
1 1 , CCR-1 1 , Chemokine receptor-like 1 , 
CCRL1, CCX CKR) 


GI7363341 


VccepenQFnPTFPT^TF^I iSFO 

ID NO:131) 


TFSI 
(SEQ ID 
NO: 132) 


X 




w - w cnemoKinc receptor type £ \ o-w uma-z , 
CCR-2, Monocyte chemoattractant protein 1 
receptor, MCP-1-R) 


UUOOOZ.. I 

Gl:472555 


ftKf^k'Cj(5RAPFA c ;! ODKFfiA 

(SEQ ID NO:133) 


KEG A 
(SEQ ID 
NO: 134) 


X 




C-C chemokine receptor type 3 (C-C CKR-3, 
CCR-3, CKR3, Eosinophil eotaxin receptor) 


U2od94.1 
Gl: 1199579 


1 CDTCC\/CDCTACDCI Cl\/C /CCO 

LbK 1 bbvbrb 1 AbKbLblvr (bbU 
ID NO:135) 


bl Vr 

(SEQ ID 
NO: 136) 


X 


AA40 


C-C chemokine receptor type 4 (C-C CKR-4, 
CCR-4, K5-5) 


AB023888.1 
Gl:6467134 


DTPSSSYTQSTMDHDLHDAL 
(SEQ ID NO: 137) 


HDAL 
(SEQ ID 
NO:138) 


V 

/\ 




CC Chemokine receptor type 5 (C-C CKR5, 
CCR5, HIV-1 fusion co-receptor, CHEMR13, 
CD195 antigen) 


X91492.1 
GM262810 


ERASSVYTRSTGEQEISVGL 
(SEQ ID NO:139) 


SVGL 
(SEQ ID 
NO:140) 


X 


AA215 


C-C chemokine receptor type 6 (CC-CKR-6, 
CCR-6, LARC receptor, GPRCY4, Chemokine 
receptor-like 3, CKR-L3, DRY6) 


U45984.1 
Gl:2246432 


NISRQTSETADNDNASSFTM 
(SEQ ID N0:141) 


SFTM 
(SEQ ID 
NO:142) 
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C-C chemokine receptor type 7 (CC-CKR-7, 
OOK-7, Mir- J Deta receptor, bDV-induced G 
protein-coupled receptor 1, EBI1, BLR2) 


L08176.1 
G 1 . 1 o34o4 


RH I RRSSMS VE AETTTTFS P 
(obU IU NO.T4o) 


TFSP 

zero ir-\ 
(otU IU 

NO: 144) 






C-C chemokine receptor type 8 (CC-CKR-8, 
CCR-8, GPR-CY6, Chemokine receptor-like 1, 
CKR-L1, TER1, CMKBRL2, CC- chemokine 
receptor CHEMR1) 


U45983.1 
Gl:2231 165 


EKSSSCQQHSSRSSSVDYIL 
(SEQ ID NO. 145) 


DYIL 
(bbU ID 
NO: 146) 


X 




Cell surface glycoprotein EMR1 precursor 
(EMRIpre, bMKl hormone receptor). 


X81479.1 

p* \-~tq a nno 
Gl. 784993 


SQSQTSRILLSSMPSASKTG 
(SbU ID NO. 147) 


SKTG 

/ccr\ i pi 
(obU IU 

NO:148) 






Chemokine receptor-like 1 (G-protein coupled 
receptor DEZ, G protein- coupled receptor 
ChemR23) 


U79526.1 
Gl. 1732342 


TKMSSMNERTSMNERETGML 

/OCP\ IPl KIP\.-1./in\ 

(SEQ ID NO. 149) 


TGML 
(otU IU 

NO:150) 






Chemokine receptor-like 2 (IL8-related 
receptor DRY12, Flow-induced endothelial G 
protein-coupled receptor, FEG-1, G protein- 
coupled receptor GPR30, GPCR-BR) 


Y08162.1 
Gl:1707499 


LKAVIPDSTEQSDVRFSSAV 
(SEQ ID NO:151) 


SSAV 

/CCrt IP\ 

(bbU IU 
NO:152) 


X 




Cholecystokinin type A receptor (CCK-A 
receptor, CCKAR) 


L1 3605.1 
Gl:306490 


TGASLSRFSYSHMSASVPPQ 
(SEQ ID NO:153) 


VPPQ 
(SEQ ID 
NO:154) 






Corticotropin releasing factor receptor 1 
precursor (CRFRIpre, CRFR, CRF1, CRHR1, 
CRHR) 


L23333.1 
Gl:408691 


SIPTSPTRVSFHSIKQSTAV (SEQ 
ID NO: 155) 


STAV 
(SEQ ID 
NO:156) 


Y 
A 




Corticotropin releasing factor receptor 2 
precursor (CRFR 2, CRF2, Corticotropin- 
releasing hormone receptor 2, CRHR 2) 


U34587.1 
61:1 144507 


SIPTSPTRISFHSIKQTAAV (SEQ 
ID NO: 157) 


TAAV 
(SEQ ID 
NO: 158) 


Y 
A 




CX3C chemokine receptor 1 (CX3CR1, 
Fractalkine receptor, GPR13, V28, Beta 
chemokine receptor-like 1 , CMKBLR1 ) 


U20350.1 
Gl:665580 


SVLSSNFTYHTSDGDALLLL 
(SEQ ID NO:159) 


LLLL 
(SEQ ID 
NO:160) 


X 




C-X-C Chemokine receptor type 3 (CXCR-3, 
CKR-L2, CD183 antigen). 


X95876.1 
Gl:1552845 


SSSRRDSSWSETSEASYSGL 
(SEQ ID NO:161) 


YSGL 
(SEQ ID 
NO:162) 


X 


AA124 


C-X-C chemokine receptor type 4 (CXC-R4, 
Stromal cell- derived factor 1 receptor, SDF-1 
receptor, Fusin, Leukocyte-derived seven 
transmembrane domain receptor, LESTR, 
LCR1, FB22, NPYRL, HM89, CD184 antigen) 


L01639.1 
Gl:189313 


KRGGHSSVSTESESSSFHSS 
(SEQ ID NO:163) 


FHSS 
(SEQ ID . 
NO: 164) 






C-X-C chemokine receptor type 5 (CXCR-5, 
Burkitt'S lymphoma receptor 1, BLR1, 
Monocyte-derived receptor 15, MDR15) 


X68149.1 
Gl:29459 


PSWRRSSLSESENATSLTTF 
(SEQ ID NO:165) 


LTTF 
(SEQ ID 
NO: 166) 


Y 

A 


AA45 


C-X-C chemokine recec-tor tvDe 6 (CXCR-6 G 
protein-coupled receptor bonzo, G protein- 
coupled receptor STRL33) 


AF007545.1 
Gl.2253421 


DNSKTFSASHNVEATSMFQL 
(SEQ ID NO:167) 


MFQL 
(SEQ ID 
NO:168) 


Y 

A 




Dopamine receptor 1A (DopRedA, DRD1) 


X55758.1 
Gl:288931 


DTDVSLEKIQPITQNGQnrT 
(SEQ ID NO:169) 


UnPl 
(SEQ ID 
NO:170) 






Dopamine receptor D2 (DRD2) 


M30625.1 
Gl:181431 


IIYI I rNlbrKKAhLKILHL- (bbU 
ID NO:171) 


ILHU 
(SEQ ID 
NO:172) 


X 




Dopamine receDtor D3 (DoDRecD3) 


U32499 1 
Gl:927341 


VIYTTFNIEFRKAFLKILSC (SEQ 
ID NO:173) 


ILSC 
(SEQ ID 
NO:174) 


Y 

A 




Dopamine receptor D4 (D(2C) dopamine 
receptor) 


AAB59386.1 
Gl:291946 


YTVFNAEFRNVFRKALRACC 
(SEQ ID NO:175) 


RACC 
(SEQ ID 
NO:176) 


Y 
A 




Dopamine receptor D5 (DopRec1B-D, DRD5, 
D-1B dopamine receptor, D5 dopamine 
receptor, Dlbeta dopamine receptor) 


X58454.1 
G 1:32048 


DCEGEISLDKITPFTPNGFH fSEQ 
IDNO:177) 


NGFH 
(SEQ ID 
NO: 178) 






EGF-like module EMR2 (EMRZegf, EMR2) 


A r" A A A A F\A A 

AF1 14491.1 
Gl:6650688 


rpi J| [ pec A 1/ A HTC l/DCT\ /M 

EMHTLSSSAKADTSKPSl VN 
(SEQ ID NO: 179) 


CTV/M 

b I VN 
(SEQ ID 
NO: 180) 






EGF-like module-containing mucin-like 
receptor EMR3 (EMR3) 


AF239764.1 
Gl:13183148 


GPDSKPSEGDVFPGQVKRKY 
(SEQ ID NO:181) 


KRKY 
(SEQ ID 






Endothelial differentiation protein 1 (EDG-1, G 
protein-coupled sphingolipid receptor) 


M31210.1 
Gl:181948 


KDEGDNPETIMSSGNVNSSS 
(SEQ ID NO:183) 


NSSS 
(SEQ ID 
NO: 184) 






Endothelial differentiation protein 4 
(Lysophosphatidic acid G protein-coupled 
receptor 4, Endothelial differentiation 
lysophosphatidic acid G-protein-coupled 
receptor 4, EDG4) 


AF233092.1 
Gl:7243675 


GASTRIMLPENGHPLMDSTL 
(SEQ ID NO:185) 


DSTL 
(SEQ ID 
NO: 186) 


X 
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Endothelial differentiation protein 5 
(Lysosphingolipid receptor EDG5) 


ArUj4/oU.l 
GL4090955 


LEKLiMMIVlr 1 brTrLEGNTvV 
(SEQ ID NO:187) 


MT\ A/ 
N I VV 

(SEQ ID 
NO: 188) 


X 




Endothelin B receptor precursor (ET-B, 
Endothelin receptor Non-selective type) 


wif **yz i .i 
Gl:182275 


CI/AMnurvnMCDCCMI/VCCC 

r l\AIN Uno Y UiNh KoolN l\ Yooo 

(SEQ ID NO:189) 


voce 
Ybbb 

(SEQ ID 
NO:190) 






Endothelin B receptor-like protein-2 precursor 
(EndoBRp2, ETBR-LP-2, ETBRLP2) 


Y16280.1 
Gl:3059117 


SIYFHKPRE5PPLLPLGTPC 
(SEQIDN0:191) 


GTPO 
(SEQ ID 
NO:192) 


X 




Endothelin-1 receptor precursor (ET-A, 
EndlRpre). 


D90348.1 
Gl:219649 


KNHDQNNHNTDRSSHKDSMN 
(SEQIDNO:193) 


DSMN 
(SEQ ID 
NO: 194) 






ETL protein (EGF-TM7-latrophilin-related 
protein, ETL) 


AF1 92403.1 
Gl: 11225482 


IQEEYYRLFKNVPCCFGCLR 
(SEQ ID NO:195) 


GCLR 
(SEQ ID 
NO: 196) 






Extracellular calcium-sensing receptor 
precursor (CalRec, CASR, Parathyroid Cell 
calcium-sensing receptor, GPRC2A, PCAR1). 


X81086.1 
Gl:599819 


SQSFVISGGGSTVTENWNS 
(SEQ ID NO:197) 


WNS 
(SEQ ID 
NO: 198) 






fMet-Leu-Phe receptor (fMLP receptor, N- 
formyl peptide receptor, FPR, N-formylpeptide 
chemoattractant receptor) 


M37128.1 
Gl:189183 


TSDTATNSTLPSAEVELQAK 
(SEQIDNO:199) 


LQAK 
(SEQ ID 
NO:200) 






FMLP-related receptor 1 (FMLP-R-I, Lipoxin 
A4 receptor, LXA4 receptor, RFP, HM63) 


H7CC71 A 

M76672.1 
Gl:182666 


TNDTAANbAbPPAE I ELUAM 
(SEQ ID NO:201) 


LUAM 
(SEQ ID 
NO:202) 




- 


FMLP-related receptor II (FMLP-R-II, 
FMPLreIR) 


M76673.1 
Gl:182668 


TSNTHTTSASPPEETELUAM 
(SEQ ID NO:203) 


LUAM 
(SEQ ID 
NO:204) 






Follicle stimulating hormone receptor 
precursor (FSH-R, Follitropin receptor) 


1 in rAOC A 

M65085.1 
GM82770 


nnv rrkir*CTVII V/DI OLJI A /~\ K I 

P R VI NGS T Y I L VPLb H LAUN 
(SEQ ID NO:205) 


LAUN 
(SEQ ID 
NO:206) 






Frizzled 1 precursor (Fzd1 pre, Frizzled-1, Fz- 
1,hFz1,FzE1,FZD1). 


AF072872.1 
Gl:5305406 


NSWRKF YT RLTNbKUGb I I V 
(SEQ ID NO:207) 


E I I V 
(SEQ ID 
NO:208) 


X 




Fnzzled 10 precursor (FzdIOpre, Fnzzled-10, 
Fz-10, hFz10, FzE7, FZD10) 


AB027464.1 
Gl:5834487 


H PQKTH H G K YE 1 PAUSPTC V 
(SEQ ID NO:209) 


PTCV 
(SEQ ID 
NO:210) 


X 




Frizzled 2 precursor (Fzd2pre, Frizzled-2, Fz- 
2, hFz2, FzE2, FZD2). 


1 I^OOO A 

L37882.1 
GL736678 


UCVA/DI/rVTDI TKIODU/^CI I V/ 

HSWRKFYTRLTNSRHGETTv 
(SEQ ID N0:211) 


rT TV/ 

b I I V 
(SEQ ID 
NO:212) 


X 




Frizzled 3 precursor (Fzd3pre, Frizzled-3, Fz- 
3, hFz3, FZD3). 


AB039723.1 
Gl:7670051 


TH ITH GTS M N RVI E E DGTS A 
(SEQ ID NO:213) 


pre A 

G I bA 
(SEQ ID 
NO:214) 


X 




Frizzled 4 precursor (Fzd4pre, Frizzled-4, Fz- 
4, hFz4, FzE4, FZD4). 


AB032417.1 
Gl:6277265 


KREKRGNGWVKPGKGSETVV 
(SEQ ID NO:215) 


ETW 
(SEQ ID 
NO:216) 


X 




Fnzzled 5 precursor (Fzdopre, Fnzzled-5, Fz- 
5, hFz5, FzE5, FZD5, HFZ5). 


1 1 A 1 O A O A 

U43318.1 
Gl:1151251 


RTGPPGPAATYHKUVSLSHV 
(SEQ ID NO:217) 


LSHV 
(SEQ ID 
NO:218) 


X 




Frizzled 6 precursor (Fzd6pre, Frizzled-6, Fz- 
6, hFz6, FZD6)). 


Aboi^yn .i 
Gl:3062802 


LVHrVboVKKbUovjvjUMbU 1 
(SEQ ID NO:219) 


LIGHT 

MbU I 
(SEQ ID 
NO:220) 






Frizzled 8 precursor (Fzd8pre, Frizzled-8, Fz- 
8, hFz8, FZD8). 


AB04o703.1 
G 1:1 3623798 ' 


WRbO I AbbvbYPKUMrLbUV 
(SEQIDNO:221) 


LbUV 

(SEQ ID 
NO:222) 


X 




Frizzled 9 precursor (Fzd9pre, Frizzled-9, Fz- 
9, hFz9, FzE6, FZD9). ' 


1 1 DO A CO 4 

U821by.1 
Gl:1906597 


P I VVLHM I K I DPbLbNr I ML 
(SEQ ID NO:223) 


□TU 1 

r 1 ML 

(SEQ ID 
NO:224) 


X 




Galanin receptor type 1 (GAL1-R, GALK1) 


L34oJy.1 
Gl:559047 


D I KENKbRlU I Prb I NO I HV 
(SEQ ID NO:225) 


U 1 MV 

(SEQ ID 
NO:226) 


X 




Galanin receptor type 2 (GAL2-R, GALR2) 


AF040630.1 


PGPSWQGPKAGDSILTVDVA 

\OC\U l\J \\\J.£.Z.f ) 


VDVA 
NO:228) 


X 




Galanin receptor type 3 (GAL3-R, GALR3). 


AF073799.1 
GL3608409 


QGPEPREGPVHGGEAARGPE 
(SEQ ID NO:229) 


RGPE 
(SEQ ID 
NO:230) 






Gamma-aminobutyric acid type B receptor, 
subunit 1 precursor (GABAB1 pre, GABA-B- 
R1, Gb1, GABBR1, GABA-B receptor 1, 
GBR1) 


AJ225028.1 
GL3892593 


PPEPPDRLSCDGSRVHLLYK 
(SEQIDNO:231) 


LLYK 
(SEQ ID 
NO:232) 
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Gamma-aminobutyric acid type B receptor, 
subunit 2 precursor (GABABR2p, GABA-B 
receptor 2, GBR2, GABBR2, GABA-B-R2, 
LjD*:, o protein-coupled receptor oi, l»kkd«, 
HG20). 


A \f\ A r\ A O O A 

AJ012188.1 
Gl:3776097 


TASPRHRHVPPSFRVMVSGL 
(SEQ ID NO:233) 


VSGL 
(SEQ ID 
NO:234) 


X 




Gastric inhibitory polypeptide receptor 
precursor (olrKpre, olrK, oiucose- 
dependent insullnotropic polypeptide 
receptor). 


U39231.1 
ol. lUbbuoU 


SSGTLPGPGNEASRELESYC 

(otU IU IMU.ZOO) 


ESYC 
/cca m 

(otU IU 

NO:236) 


X 




Gastrin/cholecystokinin type B receptor (CCK- 
B receptor, CCK-BR) 


i no *. a\ o a\ 

Gl:306488 


DCIACI CDI CVTTICTI pnp /OCH 

KbIAbLbKLbY 1 1 lb i Lorb (otU 

ID NO:237) 


Lorb 

(SEQ ID 
NO:238) 






Gastrin-releasing peptide receptor (GRP-R, 
GRP-preferring bombesin receptor) 


k no a a a a 

M73481.1 
Gl: 183649 


NPbVA 1 rbLlNGNIOHbKYV 
(SEQ ID NO:239) 


bKYV 
(SEQ ID 
NO:240) 


X 




GHRH receptor splice vanant 1 (GHRHRspl) 


Ar282259.1 
Gl: 10242291 


rn a |/\AnTDCDO A A U'V/I "TCAA/"" 

I RAKW1 1 PoKoAAKVL I oMU 
(SEQ ID NO:241) 


TP KAr* 

\ oMt 

(SEQ ID 
NO:242) 


X 




glucagon receptor precursor (GlucagRp, GL- 
R, GCGR) 


UU3469.1 

Gl:439689 


UbbAb I rLA(jL>LrKL>\torr 
(SEQ ID NO:243) 


code: 
bbrr 

(SEQ ID 

NO:244) 


X 




Glucagon-like peptide 1 receptor precursor 
(GLP1 Rpre, GLP-1 receptor, GLP-1-R) 


1 Irt A A f\A A 

Gl:405081 


bSGAl AGbbM Y l A TOUAbUb 
(SEQ ID NO:245) 


AbUb 
(SEQ ID 
NO:246) 






Glucagon-like peptide 2 receptor precursor 
(GLP2Rpre, GLP-2 receptor, GLP-2-R, 
GLP2R). 


Ar 1 UOOO ( . 1 

GU324490 


otbUV 1 MAN \ MbblLbbbbl 

(SEQ ID NO:247) 


bbbl 
(SEQ ID 
NO:248) 


X 




Gonadotropin-releasing hormone receptor 
(GNRH-R) 


LQ33B0.1 
GM83421 


El CACI KJD/^Cr\DI IVPVCC 1 /CCA 

rLrArLNrOrUrLIYoYrbL (bbU 
ID NO:249) 


YrbL 
(SEQ ID 
NO:250) 


X 




G-protein coupled receptor 91 (GPCR91) 


Ar34oU/o.l 
Gl:13517982 


r\bl_ 1 brbKWAHbLLLbrKbK. 
(SEQ ID NO:251) 


rnri/ 

rKbfN. 

(SEQ ID 
NO:252) 






G-protein coupled receptor EDG-7 (EDG7) 


Ah23b 117.1 
Gl:9651838 


C^bUYKbUbbbUvj I VUNKNbb 
(SEQ ID NO:253) 


l/MOC 

r\Nbb 
(SEQ ID 
NO:254) 






G-protein-coupled receptor 74 (GPCR74) 


AF236083.1 
GM4279164 


QNPHGETLLYRKSAENPNRN 
(SEQ ID NO:255) 


PNRN 
(SEQ ID 
NO:256) 






Green-sensitive opsin (Green cone 
photoreceptor pigment) 


M13306.1 
Gl: 180688 


SELSSASKTEVSSVSSVSPA 
(SEQ ID NO:257) 


VSPA 
(SEQ ID 
NO:258) 


V 




Growth hormone secretagogue receptor type 
1 (GHS-R, GH-releasing peptide receptor, 
GHRP, Ghrelin receptor) 


U60179.1 
Gl: 15041 40 


KLSTLKDESS RAWTESS I NT 
(SEQ ID NO:259) 


SINT 
(SEQ ID 
NO:260) 






Growth hormone-releasing hormone receptor 
precursor (GHRHRpre, GRFreceptor, GRFR). 


\ r\A a r\c i 

L0140b. 1 
GM83172 


rn A 1/1A/TTDCDC A A IS\ /I TC A JlO 

I KAKW I I rb KbAAKVL I oMU 
(SEQ ID NO:261) 


1 oMU 
(SEQ ID 
NO:262) 


X 




Histamine HI receptor (HistHIR) 


Z34897.1 
Gl:510295 


Vni PMCMCI/l/Tri/DI I LJIDC /CCA 

YrLONcNrKKI rKKILHIKb (bbU 
ID NO:263) 


LJIDC 

HIKb 
(SEQ ID 
NO:264) 






Histamine H2 receptor (HistH2R, H2R, Gastnc 
receptor 1) 


MC47AA 4 

M04799.1 
Gl:184087 


I rWAA/O^TCVrTADrtAATnD 

LKLUWVbo 1 b V I ArUbA I UK 
(SEQ ID NO:265) 


A 1 UK 
(SEQ ID 
NO:266) 






Histamine Ho receptor (HH3R, G protein- 
coupled receptor 97) 


Ar 140538.1 
Gl:5031290 


I 1 rDHI/l 1/IADUCCI CUir*VA/l/^ 

LLOrUKLKIUrHbbLbHOWK 
(SEQ ID NO:267) 


HL/Wft 
(SEQ ID 
NO:268) 






Ulptnminn LI/I rn^nrstAr (\ 11 MP 1 PDDuC} /~> 

Histamine n4 receptor (HH4K, urKvoo, o 
protein-coupled receptor 105, GPCR105, 
SP9144, AXOR35) 


AbU44y34.1 

Gl:10241846 


r\lrL»lf\AUr LroUnonoVoo 
(SEQ ID NO:269) 


(SEQ ID 
NO:270) 






HUKb betaio (HUKo D13) 


A AT/ 1 £77 4 

AAb41b/ / .1 

Gl:1 1908212 


ULfCMCI PTCMAI DMVI CLir\n 

nKrMbLU 1 bNALrNYLrHUU 
(SEQ ID NO:271) 


rnvju 
(SEQ ID 
NO:272) 






nwrw Dctao ^rnjrw 00; 


nnUf IDOO. I 

Gl:1 1908218 


r\ 1 rxWIWINrMLriLi 1 1 nrxlo 1 \OLU 

ID NO:273) 


(SEQ ID 
NO:274) 






HOR5'beta6 (HOR5'b6) 


AAG41682.1 
Gl:1 1908217 


KTKQIQSGILRLFSLPHSRA (SEQ 
ID NO:275) 


HSRA 
(SEQ ID 
NO:276) 


X 




HOR5'beta7 (HOR5'b7) 


AAG41681.1 
Gl:11908216 


KTKEI H RAI 1 KLLG LK KAS K (SEQ 
ID NO:277) 


KASK 
(SEQ ID 
NO:278) 







128 



HOR5 betao (HOR5 bo) 


AAo41bo0.1 
Gl:11908215 


KTKblnoAIVKMLLblSKKKV 
(SEQ ID NO:279) 


DDD\/ 

(SEQ ID 
NO:280) 


X 




human TA2R, beta isoform (TA2Rbiso, 
TBXA2R) 


AAO^4o02.1 
GL3253117 


AGVQLLrrbrr 1 oKALoKKU 
(SEQ ID NO:281) 


OKI\U 

(SEQ ID 
NO:282) 






U*nf)/v, O ^Art^rtl^^ A /II ODA kink nfRniki 

inteneukin-o receptor A (lLoKA, nigh affinity 
IL-8 receptor A, IL-8 receptor type 1 , CXCR-1 , 
CDw128a) 


AAboy43b.1 
Gl:559050 


1 a DUD\/TCVTCCC\/M\/CCMI 

LARHKV 1 oY 1 boo VIN VooinL 
(SEQ ID NO:283) 


CCMI 
OOINL 

(SEQ ID 
NO:284) 


X 


A A9Q 1 


Interleukin-o receptor B (ILoKB, high affinity 
IL-8 receptor B, CXCR-2, GRO/MGSA 
receptor, CDw128b) 


M73969.1 
Gl:186516 


PKDSRPorVbbbbbn 1 o 1 1 L 
(SEQ ID NO:285) 


CTTI 
O I I L 

(SEQ ID 
NO:286) 


X 


A A OQ O 

AAzy.^ 


KlAAOo^i protein. 


AdUzuozo.1 

Gl:4240127 


rbLtbrbrUulJuUMULv 1 oL 

(SEQ ID NO:287) 


V 1 OL 

(SEQ ID 
NO:288) 


X 




Latrophllin-2 (LPrlnl, LbOl, U\\HVFnZ) 


AJ1 OlOOl .1 

Gl:4034485 


C/^rMDCfT^/DCf^r^MOl \/TCI 

toUtrtbU vKboUM'JLv 1 oL 

(SEQ ID NO:289) 


\/TQI 
V 1 OL 

(SEQ ID 
NO:290) 


X 




Lectomedm-1 alpha (LbClalph, LfcOl) 


AMQ42ob.1 
Gl:5880489 


D AUI /^irM N/LJ1 CI 1 1 AOIA 

oLKAnLUDLYHLbLLLoUIA 
(SEQ ID NO:291) 


oVJIA 

(SEQ ID 
NO:292) 


X 




Lectomedin-1 beta (LbMbeta, Lfcol) 


AM 04938.1 
Gl:5880491 


\ f\S A CTTDTC A DVCCPTHniU 

VKAo I I K I oAKYooo I UUIH 
(SEQ ID NO:293) 


UUIH 
(SEQ ID 
NO:294) 






Lectomedin-2 (LbC2) 


Ah 307079.1 
Gl:1 1037013 


rbLtorbHUbUbUMULV I oL 
(SEQ ID NO:295) 


V 1 oL 
(SEQ ID 
NO:296) 


X 




Lectomedin-3 (Lecmedo, LbC3) 


Ar307080.1 
Gl:1 1037015 


ir* a o c nonr vi/fur vcttc\a/ 
IbAobUUUbYKOnoTb I I hW 

(SEQ ID NO:297) 


1 1 bW 
(SEQ ID 
NO:298) 


X 




Leucocyte antigen CD97 precursor (CD97pre, 
CD97) 


X84 700.1 
Gl:840770 


I 1 oo 1 onNU 1 KALKAobool 
(SEQ ID NO:299) 


boUl 
(SEQ ID 
NO:300) 


X 




Leukotriene B4 receptor 2 (BLTR2, Seven 
transmembrane receptor BLTR2) 


A l070CnC 4 

AJ278605.1 
Gl:8919627 


bRuNbDrouoMhKOorhVVUL 
(SEQIDNO:301) 


(SEQ ID 
NO:302) 






Luteinising hormone-choriogonadotropin 
receptor (Luteinizing hormone receptor) 


X84753.1 
Gl: 1225983 


LSTLHCUoTALLDK 1 KY 1 bO 
(SEQ ID NO:303) 


VTCP 

Y 1 bo 
(SEQ ID 
NO:304) 


X 




Lysophosphatidic acid receptor (EDG-2). 


U80811.1 
G 1:1 857424 


ACPI KtLJXIl AP\/UCMnU 0\ A / 

AooLNnTll^oVHoNDnoVV 
(SEQ ID NO:305) 


l_IC\A/ 

(SEQ ID 
NO:306) 


X 




Lysosphingolipid receptor (EDG-3). 


VQOOC/1 4 

A00004.1 

GL1770395 


DrooOIMUKNAALUNolrLrN 
(SEQ ID NO:307) 


(SEQ ID 
NO:308) 






Melanocortin-3 receptor (MC3-R) 


L06 155.1 
Gl:188673 


LbLRNTrRblLCoUNoMNLo 
(SEQ ID NO:309) 


MNLo 
(SEQ ID 
NO:310) 






Mplanomrtin-4 rpf^pntnr ^MC4-R^ 


L08603 1 

LUUuu J. 1 

Gl:291977 


FKEIICCYPLGGLCDLSSRY (SEQ 
IDN0:311) 


SSRY 
(SEQ ID 
NO:312) 


v 

A 




Mplannmrtin-*} rpnpntnr fM05-R MG-?) 


Z25470.1 
Gl:939924 


FKEIICCRGFRIACSFPRRD fSEQ 
IDNO:313) 


PRRD 
(SEQ ID 
NO:314) 






Mplannrvtp ^timiilatlnn hnrmnnp rpr^pntnr 

(MSH-R, Melanotropin receptor, Melanocortin- 
1 receptor, MC1-R) 


X65634 1 
Gl:34790 


YAFHSQELRRTLKEVLTCSW 
(SEQ ID NO:315) 


TCSW 
(SEQ ID 
NO:316) 






Melatonin receptor type ia (Mei-iA-K) 


U141U0.1 

Gl:602129 


V/I/\A/I/DCDI H^TMMM\ A/L^X/RCW 

VKWr\rorLlvl i INIMIN V Vf\VUoV 

(SEQ ID NO:317) 


\/nc\/ 

(SEQ ID 
NO:318) 


X 




Melatonin receptor type IB (MeMb-K) 


I ncj A 4 4 

Gl:971193 


c^i noDADDiirv/nunAnAi 
bo LUbr Arr I lo VUnvJAUAL 

(SEQ ID NO:319) 


AUAL 

(SEQ ID 
NO:320) 


X 




Malatrtnin rolatoH rorontnr / LJ Q f^DD Cfl \ 

iviciaionin-r cidicu recepiur ^ny, or rwuj 


Gl:1326154 


w n v w n\AA A/n \/f n n p n f m av 
(SEQ ID NO:321) 


EMAV 
(SEQ ID 
NO:322) 


X 




Metabotropic glutamate receptor 1 precursor 
(GluRlpre, GRM1, GPRC1A, mGluRI) 


U31215.1 
Gl:945096 


PNVSYASVILRDYKQSSSTL 
(SEQ ID NO:323) 


SSTL 
(SEQ ID 
NO:324) 


X 




Metabotropic glutamate receptor 2 precursor 
(GluR2pre, MGR2, mGluR2, GRM2, 
GPRC1B) 


L35318.1 
GL999415 


QFVPTVCNGREWDSTTSSL 
(SEQ ID NO:325) 


TSSL 
(SEQ ID 
NO:326) 


X 
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M eta bo tropic glutamate receptor 3 precursor 
(GluR3pre, GRM3, GPRC1C, mGluR3) 


GM171563 


I Y Vr 1 VOINuKtVLUo 1 1 ooL 

(SEQ ID NO:327) 


1 OoL 

(SEQ ID 
NO:328) 


X 




Metabotropic glutamate receptor 4 precursor 
(GluR4pre, mGluR4, GRM4, GPRC1D, 
MGR4) 


AoUO IO. I 

Gl:1160182 


1 PAPAI ATknTYVTYTNWAI fQPO 
LCAr ALA 1 1 T V 1 T 1 INNA1 (otU 

ID NO:329) 


MMAI 
INrirVI 

(SEQ ID 
NO:330) 






Metabotropic glutamate receptor 5 precursor 
(GluRSpre, mGluRS, GRM5, GPRC1E, 
MGR5) 


Uzoooo.i 
Gl:1408051 


OorfSYU 1 LIIKUY 1 UooooL (otU 

ID NO:331) 


QQQI 
OOOL 

(SEQ ID 
NO:332) 


X 




Metabotropic glutamate receptor 6 precursor 
(GluR6pre, GRM6, GPRC1F, mGluR6) 


Uo/Uoo.l 

Gl:2231437 


Lr\A 1 o 1 VAArrt\otUAtAnr\ 
(SEQ ID NO:333) 


tAnrs 
(SEQ ID 
NO:334) 






Metabotropic glutamate receptor 7 precursor 
(GluR7pre, GRM7, GPRC1G, mGluR7) 


Ay^ODZ.l 

Gl:1370110 


\/TlPMQDA AkVlO/\/QVMMI \/l 
VUrlNor AAI\r\.f\Y VoYINlNLVI 

(SEQ ID NO:335) 


Ml \/l 
INL VI 

(SEQ ID 
NO:336) 


X 


AA1 1 A 
AA I l*t 


Metabotropic glutamate receptor 8 precursor 
(GluR8pre, GRM8, GPRC1H, mGluR8) 


Gl:1 935042 


1 CTMTCCTW T'l VIQVQMUQI fQFn 
Lt 1 IN 1 oo 1 f\ 1 1 Ylo YolNnol (OCU 

ID NO:337) 


INPlOl 

(SEQ ID 
NO:338) 






Motilin receptor (G protein-coupled receptor 
GPR38) 


ArUonooz. I 
Gl:2654158 


U 1 boU 1 VoT 1 1 1 oAlNVft. 1 Mu 

(SEQ ID NO:339) 


f\ I Mo 

(SEQ ID 
NO:340) 






Muscarinic acetylcholine receptor M1 
(AchRml) 


AOZUOO.l 

Gl:34450 


(SEQ ID NO:341) 


(SEQ ID 
NO:342) 


X 




Muscarinic acetylcholine receptor M2 


M ID4U4.1 

Gl:177989 


ri\f\ 1 rftHLLIVlUn YlMNloA 1 K 

(SEQ ID NO:343) 


oA I K 

(SEQ ID 
NO:344) 






Muscarinic acetylcholine receptor M3 
(AchRM3) 


A10ZO0.1 

Gl:32323 


nnvnoBnQ\/icui^DAPFnAi 

KJ U Y \J\J. K Uj V 1 P PI l\P\Ar LUnL 

(SEQ ID NO:345) 


PHAI 

(SEQ ID 
NO:346) 


X 


A A OCO 


Muscarinic acetylcholine receptor M4 
(AchRM4) 


M1D4U0.1 

Gl:177991 


FlfLfTFPl-ll 1 1 PfWPMIfiTAP 

(SEQ ID NO:347) 


O 1 r\f\ 

(SEQ ID 
NO:348) 






Muscarinic acetylcholine receptor M5 


IVioUooo.l 
GM77987 


D\A/lf IfWX/FFLfl V\A/PW F 2MQk'l p 
K Wf\lv\r\ V 1 1 l\L Y WUIj INo t\Lr 

(SEQ ID NO:349) 


ol\Lr 
(SEQ ID 
NO:350) 






Neuromedin K receptor (NKR, Neurokinin B 
receptor, NK-3 receptor, NK-3R) 


ooDoyz.i 
Gl:246908 


oAoA 1 oorloor Y 1 o VUfc To 

(SEQ ID NO:351) 


UtTg 

(SEQ ID 
NO:352) 






Neuromedin K receptor (NKR, Neurokinin B 
receptor, NK-4 receptor, NK-4R, K1R, 
Neurokinin 4 receptor, NK4) 


MO*K>U0. 1 

Gl:1 89391 


olol I Aor Vooonlvlo V ttbo 
(SEQ ID NO:353) 


ttoo 
(SEQ ID 
NO:354) 






iNeuromeoin u receptor i (nmuki ; 


Gl:10946200 


WVnrL^oNUorLAVJUc I Uro 

(SEQ ID NO:355) 


l Uro 

(SEQ ID 
NO:356) 






Neuromedin u receptor z (NeUK*:, 
Neuromedin U receptor-type 2, G protein- 
coupled receptor TGR-1) 


ArZ/^ODO.l 
Gl: 10946202 


ALootVJMoK I IN YUoPMPNrS, I 

(SEQ ID NO:357) 


PMkT 

(SEQ ID 
NO:358) 






Neuromedin-B receptor (NMB-R, 
Neuromedin-B-preferring bombesin receptor) 


M/o4oZ.l 

GM89241 


INMV I NoVLLNonoMftULIVlAlVl 

(SEQ ID NO:359) 


tMAIVl 

(SEQ ID 
NO:360) 






Neuropeptide FF receptor 1 (NepepFFI, RF 
amide-related peptide receptor OT7T022) 


ADH/lfHfi/l 4 
ABU4U1U4. 1 

Gl:1 1125701 


I DDPfDr^QHI PI TIDAVA/Pll 
LrKturuOonLrL I IrAVVLM 

(SEQ ID NO:361) 


Awni 
(SEQ ID 
NO:362) 


X 




Neuropeptide FF receptor 2 (Neuropeptide G 
protein-coupled receptor, G-protein-coupled 
receptor HLWAR77) 


Ar 1 iyoi 0.1 
Gl:4530468 


l\rUUfcLVIVltt:LI\c 1 1 iNoofcl 

(SEQ ID NO:363) 


QQPI 

(SEQ ID 
NO:364) 


X 




Neuropeptide Y receptor type 1 (NepepYRI, 
NPY1-R) 


IVloo40 I .1 

Gl:189155 


K r\ A Q P\/ A F WK 1 M M M n Pi M F V 1 
IxvJAo Jr. V Ar x\ f\ 1 IN IN IN U U IN t T\l 

(SEQ ID NO:365) 


(SEQ ID 
NO:366) 


X 




iNeuropepuoe t receptor type z. ytsr tz-k, 
NPY-Y2 receptor, NepepYR2) 


I 1 

Uoozoy.i 
Gl:1063633 


Ml FV/Pk'MQP.PMnQFTF ATM\/ 
iNLt VKISINooKINUor 1 FA 1 INV 

(SEQ ID NO:367) 


ATM\/ 
A 1 IN V 

(SEQ ID 
NO:368) 


X 




Mpi ironf^ntiHp Y rpf^ntnr tvnp 4 /NIPY4—R 

Pancreatic polypeptide receptor 1, PP1) 


U35232 1 
Gl:1 063629 


TVHTFVSKGSL RL ^GRSNPI 
(SEQ ID NO:369) 


SNPI 
(SEQ ID 
NO:370) 






Neuropeptide Y receptor type 5 (NPY5-R, 
NPY-Y5 receptor, Y5 receptor, NPYY5) 


U56079.1 
Gl: 1438903 


GFLNNGIKADLVSLIHCLHM 
(SEQ ID NO:371) 


CLHM 
(SEQ ID 
NO:372) 






Neurotensin receptor type 1 (NT-R-1, High- 
affinity levocabastine- insensitive neurotensin 
receptor, NTRH) 


X70070.1 
Gl:35020 


ADSVSSNHTLSSNATRETLY 
(SEQ ID NO:373) j 


ETLY 
(SEQ ID 
NO:374) 


X 
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Neurotensin receptor type 2 (NT-R-2, 
Levocabastine-sensitive neurotensin receptor, 
NTR2 receptor) 


V/4 A 4 4 Q 4 

Y10148.1 
Gl:3901027 


UbP I LMU I Abbr GUPPb I K I 
(SEQ ID NO:375) 


CXDX 

b I R I 
(SEQ ID 
NO:376) 






Ocular albinism type 1 protein (OculAlbl, 
OA1) 


Z48804.1 
Gl:886873 


AbbbUNKNbV-jUPALP I HbUL 
(SEQ ID NO:377) 


HbUL 
(SEQ ID 
NO:378) 


X 




Odorant receptor HUKo oetal (HOR3 bl ) 


AA(j42oo4.1 
GM1991863 


oVK I UUIri 1 RMLRLrbLARY 
(SEQ ID NO:379) 


LI\KY 

(SEQ ID 
NO:380) 


X 




/~\ i i ____._i.__ UrtOQIU-l^O /I J/"\r_ OILO \ 

Odorant receptor HOR3 beta 3 (HOR3 b3) 


AAG42366.1 
Gl:1 1991865 


KIKblRNbVVLI LbRKRCjbF (bbU 
ID NO:381) 


Robr 
(SEQ ID 
NO:382) 


X 




Odorant receptor HOR3 betao (HOR3 b5) 


AAG42368.1 
Gl:1 1991867 


\ /_'X_''^"\l_>r\LJI\ /l/V /CCCI/1A /X /ccr/ - 1 

VKTKUIRDnlVKVFFFKKVT (bbU 
ID NO:383) 


KKV 1 

(SEQ ID 
NO:384) 






Olfactory receptor 10A4 (OlfR10A4, HP2, 
olfactory-like receptor protein JCG5) 


AF209506.1 
Gl:17016309 


KE VKAAL KRL 1 H RT LGb U KL 
(SEQ ID NO:385) 


SQKL 
(SEQ ID 
NO:386) 






Olfactory receptor 10A5 (OlfR10A5, HP3, 
Putative taste receptor JCG6) 


A A j4 r^A^ 4 

AAG45206.1 
Gl:12007436 


\ l\SK\ A 1 CDTCUI/X/I Al DKIPID 

VKNALSRTFHKVLALRNCIP 
(SEQ ID NO:387) 


NUIr 
(SEQ ID 
NO:388) 






Olfactory receptor 10H1 (OlfR10H1) 


A A ^ A Ojiri 4 

AAC08454.1 
Gl:2996652 


KVAMKKTrrSKLYPbKNVMM 
(SEQ ID NO:389) 


NVMIv! 
(SEQ ID 
NO:390) 






Olfactory receptor 10H2 (OlfK10H2) 


AAC 14388.1 
Gl:3068559 


r\bLKVAMK.K I rLb 1 LYbbo 1 
(SEQ ID NO:391) 


COPT 

bob 1 
(SEQ ID 
NO:392) 






Olfactory receptor 10J1 (OlfRIOJI, Olfactory 
receptor-like protein HGMP07J) 


Xb4995.1 
GL32092 


TLRNKbVKUALOKAVoo Krb 
(SEQ ID NO:393) 


oKrb 
(SEQ ID 
NO:394) 






/-\|.r_ _i_ _ ._ _ a 4 4 A 4 / /Mm 4 4 A 4 1 |„PIIJ 

Olfactory receptor 1 1A1 (OlfR1 1A1, Hs6M1- 
18) 


a wnoe 444 

AJ302614.1 
Gl: 12054452 


KbVnaALRKILCIKQTbTLIJ (bcU 
ID NO:395) 


cxi r» 
bl LU 

(SEQ ID 

NO:396) 






Olfactory receptor 12D3 (OlfR12D3, Hs6M1- 
27) 


CAB65796.1 
Gl:6691936 


MMALKKIFGRKLFKDWUUHH 
(SEQ ID NO:397) 


QQHH 
(SEQ ID 
NO:398) 






Olfactory receptor 1A1 (0lfR1A1, Olfactory 
receptor 17-7, OR17-7). 


a r*r» 0-70 4 o 4 

AF087918.1 
Gl:71 44622 


LRNRDMKAALRKLFNKRIbb 
(SEQ ID NO:399) 


RISS 
(SEQ ID 
NO:400) 






Olfactory receptor 1A2 (0lfR1A2, Olfactory 
receptor 17-6, OR 17-6) 


AF 155225.1 
Gl:5081803 


■ nki\A/nui/ a ai i/ 1 rpi/DlCC 

LRNWDMKAALUKLFbf\Rlbb 
(SEQ ID NO:401) 


RISS 
(SEQ ID 
NO:402) 






Olfactory receptor 1D2 (Olfactory receptor-like 
protein HGMP07E, Olfactory receptor 17-4, 
OR17-4) 


X65857.1 
Gl:425220 


NKDMHGALGRLLDKrlFKRL I 
(SEQ ID NO:403) 


I/O) T 

(SEQ ID 
NO:404) 






Olfactory receptor 1 D4 (0lfR1 D4, Olfactory 
receptor 17-30, OR17-30). 


AF 087922.1 
Gl:7144627 


NKDMnGAPGRVLWRrFURHK 
(SEQ ID NO:405) 


URrK 

(SEQ ID 
NO:406) 






Olfactory receptor 1E1 (OlfRlbl, Olfactory 
receptor-like protein HGMP071) 


AD4994.1 
Gl:32085 


DHUI/P Al CD\/luni/l/TCCC1 

RDMKGALbRVInUrvK 1 rrbL 
(SEQ ID NO:407) 


rrbL 
(SEQ ID 
NO:408) 


X 




Olfactory receptor 1E2 (OlfR1E2, Olfactory 
receptor 17-93/17-135, OR17-93) 


AF087925.1 
Gl:71 44633 


RDMKGALbRVICKKKNPFLL 
(SEQ ID NO:409) 


nr| 1 

PrLL 
(SEQ ID 
NO:410) 


X 




Olfactory receptor 1F1 (OlfR1F1, Olfactory 
receptor 16-35, OR16-35). 


Y1 4442.1 
GL2370144 


DMDVI I/OAI l/l/\A/PD\A/CCV/ 

RNRYLKGALKKWGRWFSV 
(SEQIDN0:411) 


VrbV 
(SEQ ID 
NO:412) 


X 




Olfactory receptor 1G1 (OlfRIGI, Oltactory 
receptor 1 7-209, OR1 7-209) 


AF0879Z8.1 
Gl:7144638 


k|/->C 1 I/O CM Dl/I l\A/\/Dl/"ILJCD /CCH 

NUblKbbLRKLIWVRKInbr (bbU 
ID NO:413) 


IUCD 

Irior 

(SEQ ID 
NO:414) 






Olfactory receptor 1 11 (OlfRl 11 , Oltactory 
receptor 19-20, OR19-20) 


AAL*1891_.1 
Gl:3184262 


M ri r I r Y r o Vj VUb LAtj N RUM b 
(SEQ ID NO:415) 


RUMb 

(SEQ ID 
NO:416) 






uiraciory recepior ZA4 \\j\\r\z/\ e *) 


AAnnmQ^ 1 

nnUUO I oo. I 

Gl:4159884 


1 PWQPVk'WTI k^R\/l r^\/PRAI 
LrxINOC VrxiN 1 LW^VLUVCnnL 

(SEQIDNO:417) 


FRAI 

(SEQ ID 
NO:418) 


X 




Olfactory receptor 2AG1 (OlfR2AG1, HT3) 


Q9H205 
GM4423804 


VMRALRRVLGKYMLPAHSTL 
(SEQ ID NO:419) 


HSTL 
(SEQ ID 
NO:420) 


X 




Olfactory receptor 2B2 (OlfR2B2, Olfactory 
receptor 6-1, OR6-1, Hs6M1-10) 


AJ302584.1 
GM2054392 


CPIFVITIENYCNLPQRKFP (SEQ 
ID NO:421) 


RKFP 
(SEQ ID 
NO:422) 
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Olfactory receptor 2B3 (OltR2B3, Ultactory 
receptor 6-4, OR6-4, Hs6M1-1) 


UAA 18782.1 
Gl:3757726 


N K.U M K t AF KKL M r Kl F F Of\r\ 

(SEQ ID NO:423) 


(SEQ ID 
NO:424) 






Olfactory receptor 2B6 (OlfK2Bb, Flsblvn-32, 
Olfactory receptor 6-31 , OR6-31 ). 


UAC14158.1 
Gl:10944516 


Ml/-[-\ /■/• C Cl/ D 1 \/AD\/CI 1 1/ [/• 

(SEQ ID NO:425) 


(SEQ ID 
NO:426) 






Olfactory receptor 2C1 (OlfR2C1, OLFmf3). 


AF098664.1 
Gl:3982606 


KNMbVKLjALKKLLLjKCjKtVU 
(SEQ ID NO:427) 


KbVLi 

(SEQ ID 
NO:428) 






Olfactory receptor 2D2 (OlfR2D2, Olfactory 
receptor 11-610, OR1 1-610, HB2) 


AAo45204.1 
GM2007434 


Ol DMI/n\/l/A Al Dl/V/ATDMCD 

oLKNKUVKAALKKVAl KINFF 1 
(SEQ ID NO:429) 


nkirn 

KNrr 
(SEQ ID 
NO:430) 






Olfactory receptor 2F1 (OlfR2F1, Olfactory 
receptor-like protein 0LF3). 


U56421.1 
Gl: 1336042 


i^/^ a \Ktr\\s\ i lA/i/rcr 1 1 toi/i ax 
KCaAWUKLLWKFoCjjL 1 bKLA 1 

(SEQIDNO:431) 


IV | a T 

(SEQ ID 
NO:432) 






Olfactory receptor 2F2 (OlfR2F2, Olfactory 
receptor 7-1.0R7-1) 


AAC64378.1 
Gl:3766133 


KGAVVHKLLcKFooL 1 bKLb 1 
(SEQ ID NO:433) 


\s\ f->~T 

(SEQ ID 
NO:434) 






Olfactory receptor 2H1 (OR2H1, OlfR2m, 
Hs6M1-16, Olfactory receptor 6-2, OR6-2) 


AJ302oQ4.1 
Gl:1 2054432 


DAI DDI 1 OI/CDnODCCVA/DA A 

RALRRLLbKbKDbKtbWKAA 
(SEQ ID NO:435) 


va/d a a 
WKAA 

(SEQ ID 
NO:436) 


X 




Olfactory receptor 2H3 (OlfR2H3, Olfactory 
receptor-like protein FAT11) 


L35475.1 
Gl:1041044 


DACDDI 1 PI/CDnODCClA/DAA 

FvAFKRLLGKcRUoKtbWKAA 
(SEQ ID NO:437) 


VA/D A A 

WKAA 
(SEQ ID 
NO:438) 


X 




Olfactory receptor 2J2 (Olfactory receptor 6-8, 
OR6-8, HS6M1-6) 


AJ302571 .1 
Gl:1 2054366 


LRNKHVKGAAKRLLCjWbVVOK 
(SEQ ID NO:439) 


bWor\ 
(SEQ ID 
NO:440) 






Olfactory receptor 2J3 (OlfR2J3, Olfactory 
receptor 6-6, OR6-6, Hs6M1-3). 


CAA 18783.1 
Gl:3757727 


1 YT LRN K WRG A V KRL M W b 
(SEQ ID NO:441) 


MoWb 
(SEQ ID 
NO:442) 






Olfactory receptor 2T 1 (OlfR2T1 , OR2T1, 
Olfactory receptor 1-25, OR 1-25) 


XM 06031O.1 
Gl:1 7437062 


ALKr<ALGRFKGrUKVbooVF 
(SEQ ID NO:443) 


ooVF 
(SEQ ID 
NO:444) 


X 




Olfactory receptor 2W1 (OlfF\2W1, HsoM1- 
15). 


CAB42853.1 
Gl:4826521 


LKKLMRFFiHKb I KIKKNOKb 
(SEQ ID NO:445) 


NoKb 

(SEQ ID 
NO:446) 






Olfactory receptor 3A1 (0lfR3A1 , Olfactory 
receptor 17-40, OR17-40) 


X80391.1 
Gl:516319 


RN PD VQSAI WRM LTG RRS LA 
(SEQ ID NO:447) 


RSLA 
(SEQ ID 
NO:448) 


X 




Olfactory receptor 3A2 (OlfR3A2, Olfactory 
receptor 17-228, OR1 7-228, OR3A2, 
OLFRA04) 


AF087930.1 
Gl:7144641 


RNPDVQGALWQ1FLGRRSLT 
(SEQ ID NO:449) 


RSLT 
(SEQ ID 
NO:450) 






Olfactory receptor 3A3 (OlfR3A3, Olfactory 
receptor 17-201, OR17-201) 


AF087926.1 
Gl:7144635 • 


RNTDVUGALoaLLVbbKbL I 
(SEQ ID NO:451) 


KbL 1 
(SEQ ID 
NO:452) 






Olfactory receptor 4F3 (OlfR4F3) 


a a nine 4 nc a 

AAD05195.1 
Gl:4159886 


EMKAAIKRVuKULVIYKKIb (bbU 
ID NO:453) 


U'DIC 

rs.Klb 
(SEQ ID 
NO:454) 






Olfactory receptor 51 B2 (HOR5 b3, 
HOR5'beta3, OR51B2) 


AAD29425.2 
Gl:1 1908208 


KTKUIUYGIIKLLbKMKFbK (bbU 
ID NO:455) 


DCCD 

KFbK 

(SEQ ID 
NO:456) 






Olfactory receptor 51 B4 (HOR5 b1 , 
HOR5'beta1, 0XB4, OR51B4) 


AAD29426.2 
Gl:1 1908209 


IKTKQIQKSIIRLFSGQSRA (SbQ 
ID NO:457) 


QSRA 
(SEQ ID 
NO:458) 


X 




Olfactory receptor 51E2 (OlfR5lb2, Prostate 
specific G-protein coupled receptor, 0XE2, 
HPRAJ, OR51E2, PSGR) 


A CO A A 1f\G A 

AF31 1306.1 
Gl:1 1875777 


R VLAM F K 1 SC D KD L UA V K 
(SEQ ID NO:459) 


(SEQ ID 
NO:460) 






Olfactory receptor 5111 (HOR5b11, 
HOR5 , beta11,OR51M). 


AAAjKC7ft A 

AAG41o79.1 
Gl:11908214 


SVKTKblRKGILKFFnKbUA (bbU 
IDNO:461) 


KbUA 

(SEQ ID 
NO:462) 


X 




Olfactory receptor 5112 (HOR5 b12, 
HOR5*beta12, OR51I2) 


A AP/He70 A 

AA(j4id78.1 
Gl:11908213 


bAKIKblRRAIFKIviFHFIIr\.l (bbU 
ID NO:463) 


Fllr\l 

(SEQ ID 
NO:464) 


X 




nifaz-trtrnv roroninr WA1/WORThZ HPCH1DR 

vjiTaciory receptor ozmumuko d**, nrrn iuk, 
HOR3'beta4, OR52A1) 


Gl:11991866 


(SEQ ID NO:465) 


(SEQ ID 
NO:466) 






Olfactory receptor 52D1 (HOR5'b14, 
HOR5'beta14, OR52D1) 


AAG41 676.1 
Gl:1 1908211 


RTKEIRSRLLKLLHLGKTSI (SEQ 
ID NO:467) 


KTSI 
(SEQ ID 
NO:468) 


X 




Olfactory receptor 5F1 (0lfR5F1, Olfactory 
receptor 11-10, OR11-10) 


095221 
GM4423782 


KEVKKALANVISRKRTSSFL 
(SEQ ID NO:469) 


SSFL 
(SEQ ID 
NO:470) 


X 
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Olfactory receptor 511 (OlfR5l1, Olfactory 
receptor-like protein OLF1) 


i ire a or* a 

U56420.1 
G 1: 1336040 


KNKDVKDAAbKVLRbKVDbb 
(SEQ ID NO:471) 


\/ncc 
VUbb 

(SEQ ID 

NO:472) 






Olfactory receptor 5U1 (0lfR5U1, Hs6M1-28). 


XM 167134.2 
Gl:22059864 


M Lb K b b LP U K KMC L KA M r KL 
(SEQ ID NO:473) 


MrKL 
(SEQ ID 
NO:474) 


X 




Olfactory receptor 5V1 (0lfR5V1, Hs6M1-21). 


CAB65797.1 
Gl:6691937 


KTlGbKWUPPISSLDSKLTY 
(SEQ ID NO:475) 


KL 1 Y 
(SEQ ID 
NO:476) 


X 




Olfactory receptor 6A1 (0lfR6A1, Olfactory 
receptor 11-55, 0R1 1-55) 


AF065870.1 
Gl:3831610 


CILHLYQHQDPDPKKG5RNV 
(SEQ ID NO:477) 


SRNV 
(SEQ ID 
NO:478) 


X 




Olfactory receptor 6B1 (0lfR6B1, Olfactory 
receptor 7-3, OR7-3, OR6B1) 


AAC64377.1 
Gl:3766132 


NRbVKEALKKLAYCQAbRbD 
(SEQ ID NO:479) 


cncr\ 
bRbL> 

(SEQ ID 
NO:480) 






Olfactory receptor 7A10 (OlfR7A10, OST027) 


AAC25627.1 
Gl:3290001 


YSLRNKHIKGAMKTFFRGKQ 
(SEQ ID NO:481) 


RGKQ 
(SEQ ID 
NO:482) 






Olfactory receptor 7A1 7 (OlfR7A1 7) 


AAB82060.1 
Gl:2447219 


YSLRNKDIKRALKMSFRGKQ 
(SEQ ID NO:483) 


RGKQ 
(SEQ ID 
N0:484) 






Olfactory receptor 7A5 (OlfR7A5, Olfactory 
receptor TPCR92). 


Y1 0530.1 
Gl:2792017 


ALGIHLLWGTMKGQFFKKCP 
(SEQ ID NO:485) 


KKOP 

(SEQ ID 
NO:486) 






Olfactory receptor 7C1 (OlfR7C1, Olfactory 
receptor TPCR86). 


AAC2 5625.1 
Gl:3289999 


LGRLLSRATFFNGDITAGLS 
(SEQ ID NO:487) 


AG Lb 
(SEQ ID 
NO:488) 






Olfactory receptor 7C2 (OlfR7C2, Olfactory 
receptor 19-18, OR19-18) 


AAC 15755.1 
Gl:3108023 


LGRLLLRATSLKEGTIAKLb (bbU 
ID NO:489) 


AKLS 
(SEQ ID 
NO:490) 






Olfactory receptor 89 (OlfR89) 


A 1 A O *1 A f\ A A 

AJ132194.1 
Gl:4160227 


NVKGALRNLVRSISALbDbo 
(SEQ ID NO:491) 


bUob 

(SEQ ID 
NO:492) 


X 




Olfactory receptor 8B8 (OlfR8B8, Olfactory 
receptor TPCR85, olfactory-like receptor 
JCG8) 


a ro^n a on a 

AF238488.1 
GM7016318 


LRNKDVKVALKKILNKNAFb 
(SEQ ID NO:493) 


M A CO 

NArb 

(SEQ ID 
NO:494) 






Olfactory receptor 8D2 (OlfR8D2, Olfactory 
receptor-like protein JCG2) 


AF162668.1 
Gl:12002781 


LRNKDVKNALKKMTRGRQSS 
(SEQ ID NO:495) 


RQSS 
(SEQ ID 
NO:496) 






Olfactory receptor H17 (0lfRHI7) 


A A A rAAA A 

AAG45208.1 
Gl:12007438 


CTLHLYQHQDPDPKKASRNV 
(SEQ ID NO:497) 


SRNV 
(SEQ ID 
NO:498) 


X 




Opioid receptor mu 1 (mIOpioiR) 


CAC 15482.1 
Gl:1 1128469 


RDHPSTANTVDRTNHQVRSL 
(SEQ ID NO:499) 


VRSL 
(SEQ ID 
NO:500) 


X 




Opioid receptor type delta (dIOpioiR, DOR-1) 


Gl:501144 


a r p p\/t a np c n n prsr; r^ a a a 
(SEQ ID NO:501) 


(^AAA 

(SEQ ID 
NO:502) 






Opioid receptor type kappa (klOpioiR, KOR- 

D 


rm*) 1 

Ul 1 U00.1 

Gl:532059 


kin l VkJL/rMT Lr\UiUoiviiNr\rv 
(SEQ ID NO:503) 


INr\r V 

(SEQ ID 
NO:504) 


X 




Opioid receptor type kappa 3 (k30pioiR, 
Nociceptin receptor, (Orphanin FQ receptor, 
kappa-type 3 opioid receptor, KOR-3) 


Y771 ^n 1 

Gl:471316 


^lA^nWAI APKTCFTVPRPA 
Olnr\U vnLAL»i\ 1 OL_ 1 Vrr\rn 

(SEQ ID NO:505) 


PRPA 
(SEQ ID 
NO:506) 


X 




Opioid receptor type mu {mOpioiR, MOR-1) 


a A Ainron a 

AAA20580.1 
Gl:452073 


TVDRTNHQLbNLEAETAPLP 
(SEQ ID NO:507) 


APLP 
(SEQ ID 
NO:508) 






Opsin 3 (Encephalopsin, Panopsin) 


AT A Af\r\ A<% A 

AF 140242.1 
Gl:4894951 


VDDSDKTNGSKVDVIQVRPL 
(SEQ ID NO:509) 


VKPL 
(SEQ ID 
NO:510) 


X 




Opsin 4 (Melanopsin) 


AT A A ~7T O D A 

AF 147788.1 
Gl:6693700 


HEAbTPGKTKGLIPSQDPRM 
(SEQ ID NO:511) 


DPKM 
(SEQ ID 
NO:512) 






Orexin receptor type 1 (Ox1r, Hypocretin 
receptor type 1 ) 


Gl:2897123 


OQ|Cl/'|Qpl_|\ A/I TQ\/TT\/I P /CPH 
OoloMotnVVL 1 0 V 1 1 VLr ^ocvJ 

ID NO:513) 


T\/l P 
1 VLr 

(SEQ ID 
NO:514) 






Orexin receptor type 2 (Ox2r, OX2R, 
Hypocretin receptor type 2) 


AF041245.1 
Gl:2897127 


VLTSISTLPAANGAGPLQNW 
(SEQ ID NO:515) 


LQNW 
(SEQ ID 
NO:516) 






Oxytocin receptor (OT-R, OxytocR) 


X64878.1 
Gl:34764 


SFVLSHRSSSQRSCSQPSTA 
(SEQIDNO:517) 


PSTA 
(SEQ ID 
NO:518) 


X 
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P2Y purinoceptor 1 (P2Y1R, ATPreceptor, 
P2Y1, Purinergic receptor) 


Z49205.1 
Gl:798835 


^*^r^fcj*T*i kill nnri/^iii/^rxToi 

SEDMTLNILPEFKQNGDTSL 
(SEQ ID NO:519) 


DTSL 
(SEQ ID 
NO:520) 


X 


AA330 


P2Y punnoceptor 10 (P2Y10, P2Y-like 
receptor) 


AF000545.1 
Gl:2104786 


GSSVTRSRLMSKESGSSIVllG 
(SEQIDNO:521) 


SMIG 
(SEQ ID 
NO:522) 






P2Y purinoceptor 1 1 (P2Y1 1 ) 


AJ298334.1 
Gl: 12964589 


PLNATAAPKPSEPQSRELSQ 
(SEQ ID NO:523) 


ELSQ 
(SEQ ID 
NO:524) 


X 




P2Y purinoceptor 2 (P2Y2, P2U purinoceptor 
1, P2U1, ATP receptor, Purinergic receptor) 


U07225.1 
G 1:984506 


DFRRTESTPAGSENTKDIRL 
(SEQ ID NO:525) 


DIRL 
(SEQ ID 
NO:526) 


X 




P2Y purinoceptor 4 (P2Y4, Uridine nucleotide 
receptor, UNR, P2P) 


X91 852.1 
Gl:1 124904 


CRWAATPQDSSCSTPRADRL 
(SEQ ID NO:527) 


ADRL 
(SEQ ID 
NO:528) 


X 




P2Y purinoceptor 5 (P2Y5, Purinergic 
receptor 5, RB intron encoded G-protein 
coupled receptor) 


AF000546.1 
Gl:2232068 


FIQHNLQTLKSKIFDNESAA (SEQ 
ID NO:529) 


ESAA 
(SEQ ID 
NO:530) 


X 




P2Y purinoceptor 7 (P2Y7, Leukotriene B4 
receptor, Chemoattractant receptor-like 1 ) 


1 1 A A rt^rt A 

U4 1070.1 
GM469913 


EPGPSESLTASSPLKLNELN 
(SEQIDNO:531) 


NELN 
(SEQ ID 
NO:532) 






P2Y nurinoeentor 9 fP2Y9R Purineraic 
receptor 9, GPCR GPR23, P2Y5-like receptor) 


U66578 1 
GL1753100 


EEVSDQTTNNGGELMLESTF 
(SEQ ID NO:533) 


ESTF 
(SEQ ID 
NO:534) 


v 
A 




Psirathvmiri hnrmnnp rpn**ntnr nrp^ur^nr 

(PTH2Rpre, PTH2 receptor, PTHR2) 


U25128 1 
Gl:887966 


RPMESNPDTEGCQGETEDVL 
(SEQ ID NO:535) 


EDVL 
(SEQ ID 
NO:536) 


v 
A 


AA268 


Parathyroid hormone/ parathyroid hormone- 
rplatpd npntidp receotor nrecureor (PTHRDre 
PTH/PTHR receptor, PTHR1, PTHR, PTRR) 


L04308.1 
GM90721 


EASGPERPPALLQEEWE7VM 
(SEQ ID NO'537^ 


ETVM 
(SEQ ID 
NO:538) 


X 




Peropsin (Visual pigment-like receptor 
peropsin) 


AF012270.1 

/™\ 1 . r> 1ATAAA 

Gl:2307009 


PVTSILPMDVSQNPLASGRI 
(SEQ ID NO:539) 


SGRI 
(SEQ ID 
NO:540) 


X 




Pituitary adenylate cyclase activating 
polypeptide type I receptor precursor 
(PACAPRIp, ADCYAP1R1, PACR, PACAP 
type I receptor) 


NPJJ01 109.1 

/M J rA J AA1 

Gl:4501923 


LSKSSSQIRMSGLPADNLAT 
(SEQ ID NO:541) 


NLAT 
(SEQ ID 
NO:542) 






Platelet activating factor receptor (PAF-R) 


M80436.1 
Gl:189537 


DTVTE VWPFNQIPGNSLKN ' 
(SEQ ID NO:543) 


SLKN 
(SEQ ID 
N0:544) 






Probable G protein-coupled receptor GPR32 


AF045764.1 
Gl:3282838 


RAFGEEEFLSSCPRGNAPRE 
(SEQ ID NO:545) 


APRE 
(SEQ ID 
NO:546) 






Probable G protein-coupled receptor GPR35 
(GPCR35) 


AF027957.1 
Gl:2739108 


AVAPRAKAH KSQDSLCVTLA 
(SEQ ID NO:547) 


VTLA 
(SEQ ID 
NO:548) 


X 




Probable G protein-coupled receptor GPR72 
precursor (GPR72pre, GPR72, KIAA1540) 


AF2 3608 1.1 
Gl:7248881 


SQLQSGKTDLSSVEPIVTMS 
(SEQ ID NO:549) 


VTMS 
(SEQ ID 
NO:550) 






Prostacyclin receptor (Prostanoid IP receptor, 
PGI receptor, PTGIR, PRIPR) 


L29016.1 
Gl:4950*42 


SGSAVGTSS KAEAS V ACSLC 
(SEQ ID NO:551) 


CSLC 
(SEQ ID 
NO:552) 


X 




Prostanoid D Preceptor, PGD receptor) 


GL2495009 


IRPl RYR^RP^N^TNMFS^l 
(SEQ ID NO:553) 


ESSL 
(SEQ ID 
NO:554) 


X 




(PE2Rep1, Prostanoid EP1 receptor, PGE 
receptor, EP1 subtype, PE21, PTGER1). 


L22647.1 
Gl:410208 


P^AWFAS^LRS^RHSGLSHF 
(SEQ ID NO:555) 


LSHF 
(SEQ ID 
NO:556) 


X 




PmctanlanHin P9 rprpntnr PP9 cnhtvnp 
r i Uoidyidi lull i lL£. icbcfjiui , Lr £. ouuiypc 

(PE2Rep2, PTGER2, Prostanoid EP2 
receptor, PGE receptor, EP2 subtype). 


Gl:639719 


ODATOTSCSTOSDASKOADL 
(SEQ ID NO:557) 


QADL 
(SEQ ID 
NO:558) 


X 




Prostaglandin E2 receptor, EP3 subtype 
(PE2Rep3, Prostanoid EP3 receptor, PGE 
receptor, EP3 subtype) 


1 J A o ^ 4 n 4 

U13218.1 
Gl:532745 


STSLPCQCSSTLMWSDHLER 
(SEQ ID NO:559) 


HLER 
(SEQ ID 
NO:560) 






Prostaglandin E2 receptor, EP4 subtype 
(PE2ep4, Prostanoid EP4 receptor, 
PGEreceptor, EP4subtype). 


AAA QC/CJyl -1 

Gl:452496 


PCCI r>\/TCDCCTI KI) cci/n 

(SEQIDNO:561) 


tft.L»l 

(SEQ ID 
NO:562) 


X 




Prostaglandin EP3 receptor (ProsEP3R) 


BAA1 9952.1 
Gl:2114191 


LPLTLASFKLLREPCSVQLS 
(SEQ ID NO:563) 


VQLS 
(SEQ ID 
NO:564) 






Prostaglandin EP3 receptor subtype isoform 
(PEP3isof) 


D86097.1 
Gl:2102644 


QVPRTWCSSHDREPCSVQLS 
(SEQ ID NO:565) 


VQLS 
(SEQ ID 
NO:566) 
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Prostaglandin F2-alpha receptor (PF2aR, 
Prostanoid FP receptor, PGF receptor, PGF2 
alpha receptor, PTGFR) 


AF004021.1 
Gl:2257849 


NoLrSVAAIbborVAbKoAo I 
(SEQ ID NO:567) 


O A CT 

bAb 1 
(SEQ ID 
NO:568) 






Protpin^^p arti\/fltf*H rprpntnr 1 nrpnir^nr 

i IUIwm lOOw aUUVGLCu I vvvULUI 1 UlwwUIOwl 

(PAR-1, Thrombin receptor, Coagulation 
factor II receptor) 


M62424 1 
GI.339676 


SKMDTCSSNLNNSIYKKLLT 
(SEQ ID NO:569) 


KLLT 
(SEQ ID 
NO:570) 






Protf»innQP ar*tiv/atAH res Heritor O nrpri rro/"\r 

(PAR-2, Thrombin receptor- like 1 , 
Coagulation factor II receptor-like 1) 


Gl:1008084 


KH^RK^^Y^^STTVKTSY 
(SEQIDNO:571) 


KTSY 
(SEQ ID 
NO:572) 


X 




Pmtpinsi^p Jirtiv/Jitpri rpppntnr ^ nrppiirQnr 
r luicniaoc abuvaicu icucpiui «j pici»ui3Ui 

(PAR-3, Thrombin receptor- like 2, 
Coagulation factor II receptor-like 2) 


IJQOQ71 1 

Gl: 1938374 


PFI YFL M^KTRNHSTAYl TK 
(SEQ ID NO:573) 


YLTK 
(SEQ ID 
NO:574) 






Proteinase activated receptor 4 precursor 
(PAR-4, Thrombin receptor- like 3, 
Coagulation factor II receptor-like 3) 


a rannn a a a 

AF080214.1 
Gl:3396080 


SKASAtGGSRGMGTHSSLLQ 
(SEQ ID NO:575) 


SLLQ 
(SEQ ID 
NO:576) 


X 




Putative G protein-coupled receptor 54 
(GPCR54, GPR54) 


AB051 065.1 
GM4041797 


GSSG LAARGLCVLGE DN APL 
(SEQ ID NO:577) 


NAPL 
(SEQ ID 
NO:578) 


Y 
A 




Putative G protein-coupled receptor 92 
(GPCR92) 


AJ272207.1 
Gl:9843745 


RPSDSHSLSSFTQCPQDSAL 
(SEQ ID NO:579) 


DSAL 
(SEQ ID 
NO:580) 


V 
A 




Putative G protein-coupled receptor GPR44 
(Chemoattractant receptor- homologous 
molecule expressed on Th2 cells) 


AB008535.1 
Gl:4204215 


SCAASPQTGPLNRALSSTSS 
(SEQIDNO:581) 


STSS 
(SEQ ID 
NO:582) 






Putative G-Protein coupled receptor, EDG6 
precursor (EDG6pre, Hypothetical protein). 


AJ000479.1 
Gl:3805931 


RSLSFRMREPLSSISSVRSI (SEQ 
ID NO:583) 


VRSI 
(SEQ ID 
NO:584) 


X 




Red-sensitive opsin (Red cone photoreceptor 
pigment) 


M13300.1 
Gl:1 80696 


or~i a r>i/rr\ /f>f>\ it*f>\ /f>n a 

S E LSSASKTE VSS VSSVS PA 
(SEQ ID NO:585) 


VSPA 
(SEQ ID 
NO:586) 


X 




Retinal G protein coupled receptor 


BC0 11349.1 
GM5030185 


VCRG IWQCLSPQKKt KDKTK 
(SEQ ID NO:587) 


DRTK 
(SEQ ID 
NO:588) 






Rhodopsin (0psin2) 


AAC31763.1 
GM236137 


GDD EASATVS KTETSQ VAPA 
(SEQ ID NO:589) 


VAPA 
(SEQ ID 
NO:590) 


X 




Secretin receptor precursor (SecrRpre SCT- 
R). 


U20178 1 
Gl:662795 


NSTKASHLEQSQGTCRTSII 
(SEQ ID NO:591) 


TSIl 

(SEQ ID 
NO:592) 


v 
A 




Serotonin receptor 5-hydroxytryptamine 1A 
(5HT1A, G-21, ser-5-hydroxytryptamine 1A 
receptor) 


M28269.1 
GM89927 


FNKDFQNAFKKIIKCKFCRQ 
(SEQ ID NO:593) 


FCRQ 
(SEQ ID 
NO:594) 






Serotonin receptor 5-hydroxytryptamine 1B 
(5HT1B, ser- 5-hydroxytryptamine 1B rec, 5- 
HT-1D-beta, serotonin 1D beta receptor, S12) 


D1 0995.1 
Gl:219678 


MSNEDFKQAFHKLIRFKCTS 
(SEQ ID NO:595) 


KCTS 
(SEQ ID 
NO:596) 






serotonin receptor 5-hydroxytryptamine 1D 
(5HT1D, serotonin receptor 5-HT-1D-alpha, 
HTR1D) 


M89955.1 
Gl:177771 


VFNEEFRQAFQKIVPFRKAS 
(SEQ ID NO:597) 


RKAS 
(SEQ ID 
NO:598) 






Serotonin receptor 5-hydroxytryptamine 1E 
(5HT1E, serotonin receptor 5-HT1E, S31) 


M91467.1 
Gl:1 77773 


SFNEDFKLAFKKLIRCREHT 
(SEQ ID NO:599) 


REHT 
(SEQ ID 
NO:600) 






Serotonin receptor 5-hydroxytryptamine 1F (5- 
HT-1F, serotonin receptor 5HT1F) 


L05597.1 
Gl:307419 


YTIFNEDFKKAFQKLVRCRC 
(SEQ ID NO:601) 


RCRC 
(SEQ ID 
NO:602) 


X 




Serotonin receptor 5-hydroxytryptamine 2A (5- 
HT-2A, serotonin receptor 5HT2A) 


b42 168.1 
Gl:252946 


riSEEASKDNSDGVNEKVSCV 
(SEQ ID NO:603) 


vscv 

(SEQ ID 
NO:604) 


X 




Serotonin receptor 5-hydroxytryptamine 2B (5- 
HT-2B, serotonin receptor 5HT2B) 


V770O7 A. 

X77307.1 
Gl:475197 


DTLLLTbNEGDKTEEQVSYV 
(SEQ ID NO:605) 


VSYV 
(SEQ ID 
NO:606) 


X 


AA233L 


Serotonin receptor 5-hydroxytryptamine 2C 
(5-HT-2C, serotonin receptor 5HT2C) 


M81778.1 
G 1:338027 


cNLELPVNPSSWSERISSV 
(SEQ ID NO:607) 


ISSV 
(SEQ ID 
NO:608) 


X 


AA205L 


oeroionin receptor o-nyaroxyirypiarnine 4 ^o- 
HT-4, serotonin receptor 5HT4) 


vi 9^n^ 1 
Gl:2661756 


FQnPMDDATQPI \/AAHPQnT 
toUUnrrA i OrL VAMvJroLf 1 

(SEQ ID NO:609) 


roU 1 

(SEQ ID 
NO:610) 






Serotonin receptor 5-hydroxytryptamine 5A (5- 
HT-5A, serotonin receptor 5HT5A) 


X81411.1 
Gl:541776 


YTAFNKN YNSAFKN FFSRQH 
(SEQIDN0:611) 


SRQH 
(SEQ ID 
NO:612) 






Serotonin receptor 5-hydroxytryptamine 6 (5- 
HT-6, serotonin receptor 5HT6) 


L41 147.1 
GM162923 


FNIDPAEPELRPHPLGIPTN (SEQ 
ID NO:613) 


IPTN 
(SEQ ID 
NO:614) 
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Serotonin receptor 5-hydroxytryptamine 7 (5- 
HT-7, serotonin receptor 5HT7, 5HTX) 


1 ICO A OO A 

U6848B.1 
Gl:1857144 


I I K 1 V A /I A t~\l/h.A\ TT\ /Cl/l/\ /hi(ILJr\ 

H N WLAD KM LTT VE KKVM 1 H D 
(SEQ ID NO:615) 


» « i i_i r> 

(SEQ ID 
NO:616) 






Smoothened homolog precursor (SMOpre, 
SMO, Gx protein). 


1 lO A A f\ A A 

U84401.1 
Gl: 181 3875 


PIHSRTNLIvlDTELIvlDADoDF 
(SEQIDNO:617) 


DbDr 
(SEQ ID 
NO:618) 


X 




Somatostatin receptor type 1 (SS1R, SSR1, 
SSTR1, SRIF-2) 


M81829.1 
GI.307433 


NLESGGVFRNGTCTSRITTL 
(SEQ ID NO:619) 


ITTL 
(SEQ ID 
NO:620) 


X 




Somatostatin receptor type 2 (SSR2, SS2R, 
SSTR2, SRIF-t). 


M81830.1 
Gl:307435 


i kiL iiii Ann i M/™»r\i /">~roi 

LNETTETQRTLLNGDLQTSI 
(SEQ ID NO:621) 


QTSI 
(SEQ ID 
NO:622) 


X 


AA1 13 


Somatostatin receptor type 3 (SSR3, SS3R, 
SSR-28) 


M96738.1 
Gl:338498 


LLPQEASTGEKSSTMRISYL 
(SEQ ID NO:623) 


ISYL 
(SEQ ID 
NO:624) 


X 




Somatostatin receptor type 4 (SS4R, SSTR4) 


D 16826.1 
Gl:693907 


EALQPEPGRKRIP LTRTTTF 
(SEQ ID NO:625) 


I I I h 
(SEQ ID 
NO:626) 


X 


AA248 


Somatostatin receptor type 5 (SS5R, SSTR5) 


A A IS f** A 0^*^* A 

AAK6 1266.1 
Gl: 14336736 


^ a *^r^r^ All Pi A A A It, 1^1 L |^\TOl/l 

EATPPAHRAAANGLMQTSKL 
(SEQ ID NO:627) 


TSKL 
(SEQ ID 
NO:628) 


X 




Sphingosinel-phosphate receptor Edg-8 
(SPPR) 


AF3 17676.1 
Gl: 11559845 


t o r-i/^ a r^~r~ a a nTi \ /o r - i-\ a a 

TGS PG APTAARTL VS E PAAD 
(SEQ ID NO:629) 


PAAD 
(SEQ ID 
NO:630) 






Substance-K receptor (NKinin2R, SKR, 
Neurokinin A receptor, NK-2R) 


M57414.1 
Gl:189134 


GS G L W F G YG L LAPTKT H VE 1 
(SEQ ID NO:631) 


HVEI 
(SEQ ID 
NO:632) 


X 




Substance-P receptor (SPR, NK-1 receptor, 
NK-1R) 


S62045.1 
Gl:237994 


S RS DSKTMTES FS FSS N VLS 
(SEQ ID NO:633) 


NVLS 
(SEQ ID 
NO:634) 






Thromboxane A2 receptor (TBXA2R, TXA2-R, 
Prostanoid TP receptor) 


U1 1271.1 
Gl:511793 


ASRVQAILVPQPPEQLGLUA 
(SEQ ID NO:635) 


GLQA 
(SEQ ID 
NO:636) 






Thyrotropin receptor precursor (TSH-R, 
Thyroid stimulating hormone receptor) 


M73747.1 
Gl:903759 


SHLTPKKQGQISbEYMQTVL 
(SEQ ID NO:637) 


QTVL 
(SEQ ID 
NO:638) 


X 




Thyrotropin-releasing hormone receptor 
(TRH-R, Thyroliberin receptor) 


D 16845.1 
Gl:577631 


ATKVSFDDTCLASEVSFSQS 
(SEQ ID NO:639) 


FSQS 
(SEQ ID 
NO:640) 






Trace amine receptor 1 (AmineRI) 


AF380 185.1 
Gl:14600073 


FGKIFQKDSSRCKLFLtLSS 
(SEQ ID NO:641) 


ELSS 
(SEQ ID 
NO:642) 






Trace amine receptor 3 (AmineR3) 


AF380 189.1 
Gl:14600081 


tx\ #i iiii ^p^ i mil rot — r — v /i"rr\ 

KVLRTDSSTTNLFSEEVETD 
(SEQ ID NO:643) 


VETD 
(SEQ ID 
NO:644) 






Trace amine receptor 4 (AmineR4) 


AF380192.1 
Gl:14600087 


v nT*A\ /I l/AIPP ATHAII rf r 1 1 1 

vTGQVLKNSSATMNLFSEHI 
(SEQ ID NO:645) 


SEHI 
(SEQ ID 
NO:646) 


X 




Trace amine receptor 5 (TA5, GPR102) 


AF411116.1 
Gl: 16566343 


LILSGDVLKASSSTISLFLE (SEQ 
ID NO:647) 


LFLE 
(SEQ ID 
NO:648) 






Urotensin II receptor (UR-II-R) 


AF140631.1 
Gl:5902615 


LVLAPAAPARPAPEGPRAPA 
(SEQ ID NO:649) 


RAPA 
(SEQ ID 
NO:650) 


X 




vasoactive iniesunai polypeptide receptor i 
precursor (VIPR1, Pituitary adenylate cyclase 
activating polypeptide type II receptor, PACAP 
tvnp II rpnpntor PACAPR?^ 


1111 Aft7 1 

G 1:508258 


1 r\ VorunrxrvOoorUnL VOL V 

(SEQ IDNO:651) 


VOL. V 

(SEQ ID 
NO:652) 


X 




Vasoactive intestinal polypeptide receptor 2 
precursor (VIPR2pre, VIP-R-2, Pituitary 
adenylate cyclase activating polypeptide type 
III recentor PACAP tvnp HE rpnpntnr PACAP- 
R-3, Helodermin-preferring VIP receptor) 


L40764.1 
Gl:712836 


LQFH RGSRAQSFLQTETS VI 
(SEQ ID NO:653) 


TSVI 
(SEQ ID 
NO:654) 


X 


AA329 


Vasopressin receptor type 2 (VasoprR2) 


AF032388.1 


VQLWAAWDPEAPLEGGCSRG 
/ccn in mo-r^m 

\OCv \\J INVJ.OOO/ 


CSRG 
fc,Fn in 

\OuU 1 L/ 

NO:656) 






Vasopressin Via receptor (V1aR, 
Vascular/hepatic-type arginine vasopressin 
receptor, Antidiuretic hormone receptor 1a, 
AVPR Via) 


AAA62271.1 
G 1:667068 


GMWKDSPKSSKSIKFIPVST 
(SEQ ID NO:657) 


PVST 
(SEQ ID 
NO:658) 






Vasopressin V1b receptor (V1bR, AVPR V1b, 
Vasopressin V3 receptor, AVPR V3, 
Antidiuretic hormone receptor 1b) 


D31 833.1 
Gl:563981 


ESPRDLELADGEGTAETIIF (SEQ 
ID NO:659) 


TIIF 

(SEQ ID 
NO:660) 


X 
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Vasopressin V2 receptor (Renal-type arginine 
vasopressin receptor, Antidiuretic hormone 
receptor, AVPR V2) 


U04357.1 
Gl:3004498 


GPQDESCTTASSSLAKDTSS 
(SEQ ID NO:661) 


DTSS 
(SEQ ID 
NO:662) 






Vomeronasal receptor 1 (VomNasRI, Putative 
pheromone receptor V1RL1 long form, 
VNR19I1, V1RL1). 


AF302903.1 
Gl:10732801 


QFCFACRTRKTLFPNLWMP 
(SEQ ID NO:663) 


WMP 
(SEQ ID 
NO:664) 






Y6 encoding protein (Y6 protein) 


D86519.1 
GM731789 


GACWLPRISSMSSLTGIMRC 
(SEQ ID NO:665) 


IMRC 
(SEQ ID 
NO:666) 


X 
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Table 4. 



GPCR gene 


PDZ-containing 
gene 


PDZ 

Domain(s) 


alpha 1A-Adrenergic receptor 


nNOS 




beta2-Adrenergic receptor (DSLL) 


EBP 50 


1 


beta2-Adrenergic receptor (DSLL) 


SIP1 


1 


P2Y1 purinergic receptor (DTSL) 


EBP50 


1 


GRK6A (TRL) 


EBP50 


1 


beta 1 -Adrenergic receptor (DSLL) 


rat PSD95 


3 


parathyroid hormone 1 receptor 


SIP.1 


2 


parathyroid hormone 1 receptor 


EBP50 


na 


5HT2B 


cNOS 




platelet-derived growth factor receptor 


EBP50 




mGLURIa 


shank 




mGLUR5 


shank 




SSTR2 


shank 1 




SSTR2 


shank2 




IL8RB 


RGS12 




CL1 (a-latrotoxin) 


shank 




5HT2B 


Inadl 


6 


B1AR 


MAGI2 


1 


rat SSTR2 


CBP1 




5HT2C 


MUPP1 


10 


SSTR2A 


CBP1 




CIRL1 


shank2 




CIRL2 


shank2 




CIRL1 & 2 


shank family 




prolactin-releasing peptide receptor 


GRIP 




prolactin-releasing peptide receptor 


ABP 




prolactin-releasing peptide receptor 


PICK1 




kappa opioid receptor 


EBP50 


1 


mGLUR7 


PICK1 
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Table 6. 



Gene Name 


Gl or Acc# 


Domain# 


Sequence fused to GST Construct 


26s subunit p27 


9184389 


1 


RDMAEAHKEAMSRKLGQSESQGPPRAFAKVNSISPGSPSIAGLQVDDEIVEF 
GSVNTQNFQSLHNIGSWQHSEGALAPTILLSVSM (SEQ ID NO:667) 


AF6 


430993 


1 


LRKEPEIITVTLKKQNGMGLSIVAAKGAGQDKLGIYVKSWKGGAADVDGRLAA 

pnni i c v /rk/"* d o i chcdaaci mtdtcc\a/ti cwAkrv* /ceo in Mrvccox 
oUULLoVUuKoLVoLoUtKAAtLIVl 1 K 1 ooVV 1 LfcVAIMJlj (otU IU [NU.OOo; 


AIPC 


12751451 


1 


LIRPSVISIIGLYKEKGKGLGFSIAGGRDCIRGQMGIFVKTIFPNGSAAEDGRLKE 
GDEILDVNGIPIKGLTFQEAIHTFKQIRSGLFVLTVRTKLVSPSLTNSS (SEQ ID 
NO:669) 


AIPC 


12751451 


2 


GISSLGRKTPGPKDRIVMEVTLNKEPRVGLGIGACCLALENSPPGIYIHSLAPG 
SVAKMESNLSRGDQILEVNSVNVRHAALSKVHAILSKCPPGPVRLVIGRHPNP 
KVSEQEMDEVIARSTYQESKEANSS (SEQ ID NO:670) 


AIPC 


12751451 


3 


QSENEEDVCFIVLNRKEGSGLGFSVAGGTDVEPKSITVHRVFSQGAASQEGT 
MNRGDFLLSVNGASLAGLAHGNVLKVLHQAQLHKDALWIKKGMDQPRPSNS 
S(SEQ ID NO:671) 


AIPC 


12751451 


4 


LGRSVAVHDALCVEVLKTSAGLGLSLDGGKSSVTGDGPLVIKRVYKGGAAEQ 
AGIIEAGDEILAINGKPLVGLMHFDAWNIMKSVPEGPVQLLIRKHRNSS (SEQ 
ID NO:672) 


alpha actinin-2 
associated LIM 
protein 


2773059 


1 


QTVILPGPAAWGFRLSGGIDFNQPLVITRITPGSKAAAANLCPGDVILAIDGFGT 
ESMTHADGQDRIKAAEFIV (SEQ ID NO:673) 


APXL-1 


13651263 


1 


ILVEVQLSGGAPWGFTLKGGREHGEPLVITKIEEGSKAAAVDKLLAGDEIVGIN 
DIGLSGFRQEAICLVKGSHKTLKLWKRNSS (SEQ ID NO:674) 


Atrophin-1 
Interacting 
Protein 


2947231 


1 


REKPLFTRDASQLKGTFLSTTLKKSNMGFGFTIIGGDEPDEFLQVKSVIPDGPA 
AQDG KM ETG D VI VYI N E VC VLG HTH AD WKLFQS VP I GQS VN L VLCRG YP 
(SEQ ID NO:675) 


Atrophin-1 
interacting 
Protein 


2947231 


2 


LSGATQAELMTLTIVKGAQGFGFTIADSPTGQRVKQILDIQGCPGLCEGDLIVEI 
NQQNVQNLSHTEWDILKDCPIGSETSLIIHRGGFF (SEQ ID NO:676) 


Atrophin-1 
interacting 
Protein 


2947231 


3 


HYKELDVHLRRMESGFGFRILGGDEPGQPILIGAVIAMGSADRDGRLHPGDEL 
VWDGIPVAGKTHRYVIDLMHHAARNGQVNLTVRRKVLCG (SEQ ID NO:677) 


Atrophin-1 
Interacting 
Protein 


2947231 


4 


EGRGISSHSLQTSDAVIHRKENEGFGFVIISSLNRPESGSTITVPHKIGRIIDGSP 
ADRCAKLKVGDRILAVNGQSIINMPHADIVKLIKDAGLSVTLRIIPQEEL (SEQ ID 
NO:678) 


Atrophin-1 
Interacting 
Protein 


2947231 


5 


LSDYRQPQDFDYFTVDMEKGAKGFGFSIRGGREYKMDLYVLRLAEDGPAIRN 
GRMRVGDQIIEINGESTRDMTHARAIELIKSGGRRVRLLLKRGTGQ (SEQ ID 
NO:679) 


Atrophin-1 
Interacting 
Protein 


2947231 


6 


HESVIGRNPEGQLGFELKGGAENGQFPYLGEVKPGKVAYESGSKLVSEELLL 
EVNETPVAGLTIKDVL>\VlKHCKDPLRLKOVKQ(jC3lnK (bhQ ID NU.ooU) 


CARD 11 


12382772 





NLMFRKFSLERPFRPSVTSVGHVRGPGPSVQHTTLNGDSLTSQLTLLGGNAR 
GSFVHSVKPGSLAEKAGLREGHQLLLLEGCIRGERQSVPLDTCTKEEAHWTIQ 
RCSGPVTLHYKVNHEGYRKLV (SEQ ID NO:681) 


CARD 14 


13129123 


1 


ILSQVTMLAFQGDALLEQISVIGGNLTGIFIHRVTPGSAADQMALRPGTQIVMVD 
YEASEPLFKAVLEDTTLEEAVGLLRRVDGFCCLSVKVNTDGYKRL (SEQ ID 
NO:682) 


CASK 


3087815 


1 


TRVRLVQFQKNTDEPMGITLKMNELNHCIVARIMHGGMIHRQGTLHVGDEIREI 
NGISVANQTVEQLQKMLREMRGSITFKIVPSYRTQS (SEQ ID NO:683) 


Connector 
Enhancer 


3930780 




LEQKAVLEQVQLDSPLGLEIHTTSNCQHFVSQVDTQVPTDSRLQIQPGDEWQ 
INEQVWGWPRKNMVRELLREPAGLSLVLKKIPIP (SEQ ID NO:684) 


Cytohesin 
Binding Protein 


3192908 




QRKLVTVEKQDNETFGFEIQSYRPQNQNACSSEMFTLICKIQEDSPAHCAGLQ 
AGDVLANINGVSTEGFTYKQWDLIRSSGNLLTIETLNG (SEQ ID NO:685) 


Densin 180 


16755892 




RCLIQTKGQRSMDGYPEQFCVRIEKNPGLGFSISGGISGQGNPFKPSDKGIFV 
TRVQPDGPASNLLQPGDKILQANGHSFVHMEHEKAVLLLKSFQNTVDLVIQRE 
LTV (SEQ ID NO:686) 


DLG1 


475816 




IQVNGTDADYEYEEITLERGNSGLGFSIAGGTDNPHIGDDSSIFITKIITGGAAAQ 
DGRLRVNDCILQVNEVDVRDVTHSKAVEALKEAGSIVRLYVKRRN (SEQ ID 
NO:687) 


DLG1 


475816 


2 


IQLIKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIEGGAAHKDGKLQIGDKLLAV 
NNVCLEEVTHEEAVTALKNTSDFVYLKVAKPTSMYMNDGN (SEQ ID NO:688) 


DLG1 


475816 


3 


ILHRGSTGLGFNIVGGEDGEGIFISFILAGGPADLSGELRKGDRIISVNSVDLRA 
ASHEQAAAALKNAGQAVTIVAQYRPEEYSR (SEQ ID NO:689) 


DLG2 


12736552 


1 


ISYVNGTEIEYEFEEITLERGNSGLGFSIAGGTDNPHIGDDPGIFITKIIPGGAAAE 
DGRLRVNDCILRVNEVDVSEVSHSKAVEALKEAGSIVRLYVRRR (SEQ ID 
NO:690) 
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DLG2 


12736552 


2 


ISWEIKLFKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIDGGAAQKDGRLQVGD 
RLLMVNNYSLEEVTHEEAVAILKNTSEWYLKVGNPTTI (SEQ ID NO:691 ) 


DLG2 


12736552 


3 


IIA/AX/CI CPCDDIAA/I U W CTP 1 /"'CM \\tr*r* CI^P C/" 1 ICA/CCII Afi/'iDAril Cf^PI o 

IWAVoLcotrKr\VVLnKL>o 1 oLorNI VVjobUotoIr Vbrll^oorAULootLU 
RGDQILSVNGIDLRGASHEQAAAALKGAGQTVTIIAQYQPED (SEQ ID 
NO:692) 


DLG5 


3650451 




GIPYVEEPRHVKVQKGSEPLGISIVSGEKGGIYVSKVTVGSIAHQAGLEYGDQL 
LEFNGINLRSATEQQARLIIGQQCDTITILAQYNPHVHQLRNSSZLTD (SEQ ID 
NO:693) 


DLG5 


3650451 




GILAGDANKKTLEPRWFIKKSQLELGVHLCGGNLHGVFVAEVEDDSPAKGPD 
GLVPGDLILEYGSLDVRNKTVEEVYVEMLKPRDGVRLKVQYRPEEFIVTD 
(SEQ ID NO:694) 


ulod, spuce 
variant 1 






PT9PFIOFI ROMI OAPHFKAI 1 ^AHDTIAOKDFEPL LPPLPDNIPFSEEAMRIVC 
LVKNQQPLGATIKRHEMTGDILVARIIHGGLAERSGLLYAGDKLVEVNGVSVEG 
LDPEQVIHILAMSRGTIMFKWPVSDPPVNSS (SEQ ID NO:695) 


DLG6, splice 
variant 2 


AB053303 


1 


PTSPEIQELRQMLQAPHFKGATIKRHEMTGDILVARIIHGGLAERSGLLYAGDK 
LVEVNGVSVEGLDPEQViHILAMSRGTIMFKWPVSDPPVNSS (SEQ ID 
NO:69o) 


DVL1 


2291005 




LNIVTvTLNMERHHFLGISIVGQSNDRGDGGIYIGSIMKGGAVAADGRIEPGDM 
LLQVN D VN F EN MS N D DA VR VLRE 1 VSQTG P IS LTVAKC W (SEQ ID NO:697) 


DVL2 


2291007 




LNIITVTLNMEKYNFLGISIVGQSNERGDGGIYIGSIMKGGAVAADGRIEPGDML 
LQVNDMNFENMSNDDAVRVLRDIVHKPGPIVLTVAKCWDPSPQNS (SEQ ID 
NO:698) 


DVL3 


6806886 




IITVTLNMEKYNFLGISIVGQSNERGDGGIYIGSIMKGGAVAADGRIEPGDMLLQ 
VNEINFENMSNDDAVRVLREIVHKPGPITLTVAKCWDPSP (SEQ ID NO:699) 


ELFIN 1 


2957144 




TTQQIDLQGPGPWGFRLVGRKDFEQPLAISRVTPGSKAALANLCIGDVITAIDG 
ENTSNMTHLEAQNRIKGCTDNLTLTVARSEHKVWSPLV (SEQ ID NO:700) 


ENIGMA 


561636 




IFMDSFKWLEGPAPWGFRLQGGKDFNVPLSISRLTPGGKAAQAGVAVGDVW 
LSIDGENAGSLTHIEAQNKIRACGERLSLGLSRAQPV (SEQ ID NO:701) 


ERBIN 


8923908 


1 


QGHELAKQEIRVRVEKDPELGFSISGGVGGRGNPFRPDDDGIFVTRVQPEGP 
ASKLLQPGDKIIQANGYSFINIEHGQAVSLLKTFQNTVELIIVREVSS (SEQ ID 
NO. 702) 


EZRIN Binding 
Protein 50 


3220018 


1 


ILCCLEKGPNGYGFHLHGEKGKLGQYIRLVEPGSPAEKAGLLAGDRLVEVNGE 
NVEKETHQQWSRIRAALNAVRLLWDPEFIVTD (SEQ ID NO:703) 


EZRIN Binding 
Protein 50 


3220018 




IRLCTMKKGPSGYGFNLHSDKSKPGQFIRSVDPDSPAEASGLRAQDRIVEVNG 
VCMEGKQHGDWSAIRAGGDETKLLWDRETDEFFMNSS (SEQ ID NO:704) 


FLJ00011 


10440352 


1 


KNPSGELKTVTLSKMKQSLGISISGGIESKVQPMVKIEKIFPGGAAFLSGALQA 
GFELVAVDGENLEQVTHQRAVDTIRRAYRNKAREPMELWRVPGPSPRPSPS 
D (SEQ ID NO. 705) 


FLJ11215 


11436365 




EGHSHPRWELPKTEEGLGFNIMGGKEQNSPIYISRIIPGGIADRHGGLKRGDQ 
LLSVNGVSVEGEHHEKAVELLKAAQGKVKLWRYTPKVLEEME (SEQ ID 
NO:706) 


FLJ 12428 


BC012040 




PGAPYARKTFTIVGDAVGWGFWRGSKPCHIQAVDPSGPAAAAGMKVCQFW 
SVNGLNVLHVDYRTVSNLILTGPRTIVMEVMEELEC (SEQ ID NO:707) 


FLJ12615 


10434209 


1 


GQYGGETVKIVRIEKARDIPLGATVRNEMDSVIISRIVKGGAAEKSGLLHEGDE 
VLEINGIEIRGKDVNEVFDLLSDMHGTLTFVLIPSQQIKPPPA (SEQ ID NO:708) 


FLJ20075 


7019938 


1 


ILAHVKGIEKEVNWKSEDSLGLTITDNGVGYAFIKRIKDGGVIDSVKTICVGDHI 
ESINGENIVGWRHYDVAKKLKELKKEELFTMKLIEPKKAFEI (SEQ ID NO:709) 


FLJ21687 


1 0437836 




KrSQASGnrbVbLVKCjYAorvjL 1 LoooKUVAuu 1 rL^VKoLLKUorAUKOo 
RLEVGDLVLHINGESTQGLTHAQAVERIRAGGPQLHLVIRRPLETHPGKPRGV 
(SEQ ID NO:710) 


FLJ31349 


AK055911 


1 


PVMSQCACLEEVHLPNIKPGEGLGMYIKSTYDGLHVITGTTENSPADRSQKIH 
AGDEVIQVNQQTWGWQLKNLVKKLRENPTGWLLLKKRPTGSFNFTPEFIvT 
D(SEQIDNO:711) 


FLJ32798 


AK057360 


1 


LDDEEDSVKIIRLVKNREPLGATIKKDEQTGAIIVARIMRGGAADRSGLIHVGDE 
LREVNGIPVEDKRPEEIIQILAQSQGAITFKIIPGSKEETPSNSS (SEQ ID 
NO:712) 


GRIP 1 


4539083 


1 


WELMKKEGTTLGLTVSGGIDKDGKPRVSNLRQGGIAARSDQLDVGDYIKAVN 
GINLAKFRHDEIISLLKNVGERWLEVEYE (SEQ ID NO:713) 


GRIP 1 


4539083 


2 


RSSVIFRTVEVTLHKEGNTFGFVIRGGAHDDRNKSRPWITCVRPGGPADREG 
TIKPGDRLLSVDGIRLLGTTHAEAMSILKQCGQEAALLIEYDVSVMDSVATASG 
NSS (SEQ ID NO:714) 


GRIP 1 


4539083 


3 


H VATASG PLL VE VAKTPG ASLG VALTTS MCCN KQVI VI D Kt KS AS I ADRCG ALH 
VGDHILSIDGTSMEYCTLAEATQFLANTTDQVKLEILPHHQTRLALKGPNSS 
(SEQ ID NO:715) 


GRIP 1 


4539083 


4 


TETTEWLTADPVTGFGIQLQGSVFATETLSSPPLISYIEADSPAERCGVLQIGD 
RVMAINGIPTEDSTFEEASQLLRDSSITSKVTLEIEFDVAES (SEQ ID NO:716) 
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GRIP 1 


4539083 


5 


AESVIPSSGTFHVKLPKKHNVELGITISSPSSRKPGDPLVISDIKKGSVAHRTGT 
LELGDKLLAIDNIRLDNCSMEDAVQILQQCEDLVKLKIRKDEDNSD (SEQ ID 
NO:717) 


GRIP 1 


4539083 


6 


IYTVELKRYGGPLGITISGTEEPFDPIIISSLTKGGLAERTGAIHIGDRILAINSSSL 
KG KP LS EAI H LLQMAG ETVTLKI KKQTD AQSA (SEQ ID NO:718) 


GRIP 1 


4539083 


7 


IMSPTPVELHKVTLYKDSDMEDFGFSVADGLLEKGVYVKNIRPAGPGDLGGLK 
PYDRLLQVNHVRTRDFDCCLWPLIAESGNKLDLVISRNPLA (SEQ ID 
inu./ i y; 


GTPase 

Activating 

Enzyme 


2389008 


1 


SRGCETRELALPRDGQGRLGFEVDAEGFVTHVERFTFAETAGLRPGARLLRV 
CGQTLPSLRPEAAAQLLRSAPKVCVTVLPPDESGRP (SEQ ID NO:720) 


Guanine 

Exchange Factor 


6650765 


1 


AKAKWRQWLQKASRESPLQFSLNGGSEKGFGIFVEGVEPGSKAADSGLKRG 
DQIMEVNGQNFENITFMKAVEILRNNTHLALTVKTNIFVFKEL (SEQ ID NO:721) 


HEMBA 
1000505 


10436367 


1 


LENVIAKSLLIKSNEGSYGFGLEDKNKVPIIKLVEKGSNAEMAGMEVGKKIFAIN 
GDLVFMRPFNEVDCFLKSCLNSRKPLRVLVSTKP (SEQ ID NO:722) 


HEMBA 
1000505 


10436367 




PRETVKIPDSADGLGFQIRGFGPSWHAVGRGTVAAAAGLHPGQCIIKVNGINV 
SKETHASVIAHVTACRKYRRPTKQDSIQ (SEQ ID NO:723) 


HtlVlDA 

1003117 


7022001 




cDrCYVrTVcLcRGrSoLoMbLIUbMnl HLoArC^LYIU I LLPCjorAAAUljKLo 
LGDRILEVNGSSLLGLGYLRAVDLIRHGGKKMRFLVAKSDVETAKKI (SEQ ID 
NO:724) 


UTD A Q 

Hi KAo 


AYU4U094 




LTcrUUKUIKDWI\KRrlolKMRTITPoLVUtLKAoNrU 

SPSQRGGIQDGDIIVKVNGRPLVDSSELQEAVLTESPLLLEVRRGNDDLLFSNS 
S (SEQ ID NO:725) 


HTRA4 


AL576444 




HKKYLGLQMLSLTVPLSEELKMHYPDFPDVSSGVYVCKWEGTAAQSSGLRD 
HDVIVNINGKPITTTTDWKALDSDSLSMAVLRGKDNLLLTVNSS (SEQ ID 
NO:726) 


INADL 


2370148 


1 


IWQIEYIDIERPSTGGLGFSWALRSQNLGKVDIFVKDVQPGSVADRDQRLKEN 
DQILAINHTPLDQNISHQQAIALLQQTTGSLRLIVAREPVHTKSSTSSSE (SEQ 
ID NO:727) 


INADL 


2370148 


2 


PGHVEEVELINDGSGLGFGIVGGKTSGVWRTIVPGGLADRDGRLQTGDHILKI 
GGTNVQGMTSEQVAQVLRNCGNSS (SEQ ID NO:728) 


Ikl Afll 

INAUL 


Zof U l4o 


Q 

o 


Df'CnCCI rcTVM\/CI \/Dl/nPPlCI r k IDI\/r»V\/PTCUTr'CACr k IWl^CIIDr'CAAV 

rbbuooLrt 1 YNVbLVKKUL>UoLolKI VoY Vo 1 on 1 UbAoolY VrvollrVjoAAY 
HNGHIQVNDKIVAVDGVNIQGFANHDWEVLRNAGQWHLTLVRRKTSSSTSR 
IHRD (SEQ ID NO:729) 


im a r\i 
INAUL 


2o7014o 


A 

4 


NoUUAbLUKYbKLLrln 1 LKLCWbVUbrUonHYIbblvbVjorVU 1 LCjLLUrbUb 
LLEVNGMQLYGKSRREAVSFLKEVPPPFTLVCCRRLFDDEAS (SEQ ID 
NO:730) 


INADL 


237U148 


5 


LbbrbVKIVbLVKUoKoLQjrblLDYUUrLUr I Kb VI VIKoLVAUls VAtKoubLLr 
GDRLVSVNEYCLDNTSLAEAVEILKAVPPGLVHLGICKPLVEFIVTD (SEQ ID 
NO:731) 


INADL 


2370148 


6 


PNFSHWGPPRIVEIFREPNVSLGISIWGQTVIKRLKNGEELKGIFIKQVLEDSPA 
GKTNALKTGDKILEVSGVDLQNASHSEAVEAIKNAGNPWFIVQSLSSTPRVIP 
NVHNKANSS (SEQ ID NO:732) 


INADL 


2370148 


7 


PGELHIIELEKDKNGLGLSLAGNKDRSRMSIFWGINPEGPAAADGRMRIGDEL 
LEINNQILYGRSHQNASAIIKTAPSKVKLVFIRNEDAVNQMANSS (SEQ ID 
NO:733) 


INADL 


2370148 


8 


PATCPIVPGQEMIIEISKGRSGLGLSIVGGKDTPLNAIVIHEVYEEGAAARDGRL 
WAGDQILEVNGVDLRNSSHEEAITALRQTPQKVRLWY (SEQ ID NO:734) 


KIAAU14/ 


14b95f 5 


A 
1 


II XI Til DHTr'n PICIAPOI/r'eTDVI/PnnCPICICDV/OCCODA ADA^\/D\/Pni/l 

ILI LI ILKU I oIjUjIoIAooKvjo I rYKoUUbolNbKVbbbtjrAAKAvjVKVvjUKL 
LEVNGVALQGAEHHEAVEALRGAGTAVQMRVWRERMVEPENAEFIVTD 
(SEQ ID NO:735) 


\/\ A AH1 A~J 

KIAAU14/ 


a /COD7C 


2 


rLKUKnVALLAKotKbLbrolAobKoo I rY KAbUAblrVbKIAtbbAAnKAb I 

LQVGDRVLSINGVDVTEARHDHAVSLLTAASPTIALLLEREAGG (SEQ ID 
NO:736) 


KIAA0147 


1469875 


3 


ILEGPYPVEEIRLPRAGGPLGLSIVGGSDHSSHPFGVQEPGVFISKVLPRGLAA 
RSGLRVGDRILAVNGQDVRDATHQEAVSALLRPCLELSLLVRRDPAEFIVTD 
(SEQ ID NO:737) 


KIAA0147 


1469875 


4 


RELCIQKAPGERLGISIRGGARGHAGNPRDPTDEGIFISKVSPTGAAGRDGRL 
RVGLRLLEVNQQSLLGLTHGEAVQLLRSVGDTLTVLVCDGFEASTDAALEVS 


KIAA0303 


2224546 


1 


PHQPIVIHSSGKNYGFTIRAIRVYVGDSDIYTVHHIVWNVEEGSPACQAGLKAG 
DLITHINGEPVHGLVHTEVIELLLKSGNKVSITTTPF (SEQ ID NO:739) 


KIAA0313 


7657260 


1 


ILACAAKAKRRLMTLTKPSREAPLPFILLGGSEKGFGIFVDSVDSGSKATEAGL 
KRGDQILEVNGQNFENIQLSKAMEILRNNTHLSITVKTNLFVFKELLTNSS (SEQ 
ID NO:740) 


KIAA0316 


6683123 


1 


IPPAPRKVEMRRDPVLGFGFVAGSEKPVWRSVTPGGPSEGKLIPGDQIVMIN 
DEPVSAAPRERVIDLVRSCKESILLTVIQPYPSPK (SEQ ID NO:741) 


KIAA0340 


2224620 


1 


LN KRTTMPKDSGALLG LK WGG KMTD LG RLG AF ITKVKKGSLAD WG H LRAG 
DEVLEWNGKPLPGATNEEVYNIILESKSEPQVEIIVSRPIGDIPRIHRD (SEQ ID 
NO:742) 
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KIAA0380 


2224700 




QRCVIIQKDQHGFGFTVSGDRIVLVQSVRPGGAAMKAGVKEGDRIIKVNGTMV 
TNSSHLEWKLIKSGAYVALTLLGSS (SEQ ID NO:743) 


KIAA0382 


7662087 


1 


ILVQRCVIIQKDDNGFGLTVSGDNPVFVQSVKEDGAAMRAGVQTGDRIIKVNG 
TLVTHSNHLEWKLIKSGSYVALTVQGRPPGNSS (SEQ ID NO:744) 


KIAA0440 


2662160 


1 


SVEMTLRRNGLGQLGFHVNYEGIVADVEPYGYAWQAGLRQGSRLVEICKVAV 
ATLSHEQMIDLLRTSVTVKWIIPPHD (SEQ ID NO:745) 


KIAA054O 


A /i7COQcn 
14/OZOOU 




Lf\VM 1 ooWb 1 VUM 1 LKKINbLoULbrnVNYUo 1 V At V t U T u rM WUMo LKUo 

SRLVEICKVAWTLTHDQMIDLLRTSVTVKWIIPPFEDGTPRRGW (SEQ ID 
NO:746) 


KIAA0559 


3043641 


1 


HYIFPHARIKITRDSKDHTVSGNGLGIRIVGGKEIPGHSGEIGAYIAKILPGGSAE 
QTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGEAEICVRLDLNML (SEQ 


KIAA0561 


3043645 


1 


LCGSLRPPIVIHSSGKKYGFSLRA1RVYMGDSDVYTVHHWWSVEDGSPAQEA 
GLRAGDLITHINGESVLGLVHMDWELLLKSGNKISLRTTALENTSIKVG (SEQ 

IU INU. f 4oy 


KIAA0613 


3327039 


1 


SYSVTLTGPGPWGFRLQGGKDFNMPLTISRITPGSKAAQSQLSQGDLWAIDG 
VNTDTMTH LEAQN Kl KSAS YNLS LTLQKS KNSS (SEQ ID NO:749) 


KIAA0751 


12734165 


1 


ISRDSGAMLGLKWGGKMTESGRLCAFITKVKKGSLADTVGHLRPGDEVLEW 
NGRLLQGATFEEVYNIILESKPEPQVELWSRPIAIHRD (SEQ ID NO:750) 


KIAA0807 


3882334 


1 


ISALGSMRPPIIIHRAGKKYGFTLRAIRVYMGDSDVYTVHHMVWHVEDGGPAS 
EAGLRQGDLITHVNGEPVHGLVHTEWELILKSGNKVAISTTPLENSS (SEQ ID 
NO:751) 


KIAA0858 


4240204 


- 1 


FSDMRISINQTPGKSLDFGFTIKWDIPGIFVASVEAGSPAEFSQLQVDDEIIAINN 
TKFSYNDSKEWEEAMAKAQETGHLVMDVRRYGKAGSPE (SEQ ID NO;752) 


KIAA0902 


4240292 


1 


QSAHLEVIQLANIKPSEGLGMYIKSTYDGLHVITGTTENSPADRCKKIHAGDEVI 
QVN HQTWG WQLKN L VN ALRE D PSG VI LTLKKRPQSM LTS APA (SEQ ID 
NO:753) 


KIAA0967 


4589577 


1 


ILTQTLIPVRHTVKIDKDTLLQDYGFHISESLPLTWAVTAGGSAHGKLFPGDQIL 
QMNNEPAEDLSWERAVDILREAEDSLSITWRCTSGVPKSSNSS (SEQ ID 
NO:754) 


KIAA0973 


4589589 


1 


GLRSPITIQRSGKKYGFTLRAIRVYMGDTDVYSVHHIVWHVEEGGPAQEAGLC 
AGDLITHVNGEPVHGMVHPEWELILKSGNKVAVTTTPFE (SEQ ID NO:755) 


KIAA1095 


5889526 


1 


QGEETKSLTLVLHRDSGSLGFNIIGGRPSVDNHDGSSSEGIFVSKIVDSGPAAK 
EGGLQIHDRIIEVNGRDLSRATHDQAVEAFKTAKEPIWQVLRRTPRTKMFTP 
(SEQ ID NO:756) 


KIAA1095 


5889526 




QEMDREELELEEVDLYRMNSQDKLGLTVCYRTDDEDDIGIYISEIDPNSIAAKD 
GRIREGDRIIQINGIEVQNREEAVALLTSEENKNFSLLIARPELQLD (SEQ ID 
NO:757) 


KIAA1202 


6330421 


1 


RSFQYVPVQLQGGAPWGFTLKGGLEHCEPLTVSKIEDGGKAALSQKMRTGD 
ELVNINGTPLYGSRQEALILIKGSFRILKLIVRRRNAPVS (SEQ ID NO:758) 


KIAA1222 


6330610 


1 


ILEKLELFPVELEKDEDGLGISIIGMGVGADAGLEKLGIFVKTVTEGGAAQRDGR 
IQVNDQIVEVDGISLVGVTQNFAATVLRNTKGNVRFVIGREKPGQVS (SEQ ID 


KIAA1284 


6331369 


1 


KDVNNAVNPKKLTVIKAKEQLKLLEVLVGIIHQTKWSWRRTGKQGDGERLVVH 
GLLPGGSAMKSGQVLIGDVLVAVNDVDVTTENIERVLSCIPGPMQVKLTFENA 
vn\/wDCT /ccn in MfY7ftn\ 

YUVftKtl (otU IU INU. f 0\JJ 


KIAA1389 


7243158 


1 


TRGCETVEMTLRRNGLGQLGFHVNFEGIVADVEPFGFAWKAGLRQGSRLVEI 
CKVAVATLTHEQMIDLLRTSVTVKWIIQPHDDGSPRR (SEQ ID NO:761) 


KIAA1415 


7243210 


1 


VENILAKRLLILPQEEDYGFDIEEKNKAVWKSVQRGSLAEVAGLQVGRKIYSIN 
EDLVFLRPFSEVESILNQSFCSRRPLRLLVATKAKEIIKIP (SEQ ID NO:762) 


KIAA1526 


5817166 


1 


PDSAGPGEVRLVSLRRAKAHEGLGFSIRGGSEHGVGIYVSLVEPGSLAEKEGL 
RVGDQILRVNDKSLARVTHAEAVKALKGSKKLVLSVYSAGRIPGGYVTNH 
(SEQ ID NO:763) 


KIAA1526 


5817166 


2 


LQG G D E KKVNL VLG DG RSLG LTI RGG AE YG LG I YITG VDPGSEAEGSG LKVG D 
QILEVNWRSFLNILHDEAVRLLKSSRHLILTVKDVGRLPHARTTVDE (SEQ ID 
NO:764) 


KIAA1526 


5817166 


3 


WTSGAHVHSGPCEEKCGHPGHRQPLPRIVTIQRGGSAHNCGQLKVGHVILEV 
NGLTLRGKEHREAARIIAEAFKTKDRDYIDFLDSL (SEQ ID NO:765) 


KIAA1fi?fl 


10047^16 

I UU*r f O ID 




ELRRAFL VFIIVFTFAQTGVSGINVAGGGKEGIFVRELREDSPAARSLSLQEGD 
QLLSARVFFENFKYEDALRLLQCAEPYKVSFCLKRTVPTGDLALRP (SEQ ID 
NO:766) 


KIAA1634 


10047344 


1 


PSQLKGVLVRASLKKSTMGFGFTIIGGDRPDEFLQVKNVLKDGPAAQDGKIAP 
GDVIVDINGNCVLGHTHADWQMFQLVPVNQYVNLTLCRGYPLPDDSED 
(SEQ ID NO:767) 


KIAA1634 


10047344 


2 


ASSGSSQPELVTIPLIKGPKGFGFAIADSPTGQKVKMILDSQWCQGLQKGDIIK 
EIYHQNVQNLTHLQWEVLKQFPVGADVPLLILRGGPPSPTKTAKM (SEQ ID 
NO:768) 
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KIAA1634 


10047344 


3 


LYEDKPPLTNTFLISNPRTTADPRILYEDKPPNTKDLDVFLRKQESGFGFRVLG 
GDGPDQSIYIGAIIPLGAAEKDGRLRAADELMCIDGIPVKGKSHKQVLDLMTTA 
ARNGHVLLTVRRKIFYGEKQPEDDSGSPGIHRELT (SEQ ID NO:769) 


KIAA1634 


10047344 


4 


PAPQEPYDWLQRKENEGFGFVILTSKNKPPPGVIPHKIGRVIEGSPADRCGKL 
KVGDHISAVNGQSIVELSHDNIVQLIKDAGVTVTLTVIAEEEHHGPPS (SEQ ID 

NU.f f\J) 


KIAA1634 


10047344 


5 


QNLGCYPVELERGPRGFGFSLRGGKEYNMGLFILRLAEDGPAIKDGRIHVGD 
QIVEINGEPTQGITHTRAIELIQAGGNKVLLLLRPGTGLIPDHGLA (SEQ ID 
NO:771) 


KIAA1719 


1267982 


0 


ITWELIKKEGSTLGLTISGGTDKDGKPRVSNLRPGGLAARSDLLNIGDYIRSVN 
GIHLTRLRHDENTLLKNVGERWLEVEY (SEQ ID NO:772) 


KIAA1719 


1267982 


1 


ILDVSLYKEGNSFGFVLRGGAHEDGHKSRPLVLTYVRPGGPADREGSLKVGD 
RLLS VDG I PLHGASHATALATLRQCSH EALFQVEYDVATP (SEQ ID NO:773) 


KIAA1719 


1267982 


2 


In 1 VANAoorLMVblVK. 1 rljoALAjloL 1 1 1 oLKNf\ovl 1 lUKIKrAo V VUKooALH 
PGDHILSIDGTSMEHCSLLEATKLLASISEKVRLEILPVPQSQRPL (SEQ ID 
NO:774) 


KIAA1719 


1267982 


3 


IQIVHTETTEWLCGDPLSGFGLQLQGGIFATETLSSPPLVCFIEPDSPAERCGL 
LQVGDRVLSINGIATEDGTMEEANQLLRDAALAHKWLEVEFDVAESV (SEQ 
ID NO:775) 


KIAA1719 


1267982 


4 


IQFDVAESVIPSSGTFHVKLPKKRSVELGITISSASRKRGEPLIISDIKKGSVAHR 
TGTLEPGDKLLAIDNIRLDNCPMEDAVQILRQCEDLVKLKIRKDEDN (SEQ ID 
NU.77D/ 


KIAA1719 


1267982 


5 


IQTTGAVSYTVELKRYGGPLGITISGTEEPFDPIVISGLTKRGLAERTGAIHVGD 
RILAINNVSLKGRPLSEAIHLLQVAGETVTLKIKKQLDR (SEQ ID NO:777) 


KIAA1719 


1267982 


6 


ILEMEELLLPTPLEMHKVTLHKDPMRHDFGFSVSDGLLEKGVYVHTVRPDGPA 
HRGGLQPFDRVLQVNHVRTRDFDCCLAVPLLAEAGDVLELIISRKPHTAHSS 
(SEQ ID NO:778) 


LIM Mystique 


12734250 


1 


MALTVDVAGPAPWGFRITGGRDFHTPIMVTKVAERGKAKDADLRPGDIIVAIN 
GESAEGMLHAEAQSKIRQSPSPLRLQLDRSQATSPGQT (SEQ ID NO:779) 


LIM Protein 


3108092 


1 


SNYSVSLVGPAPWGFRLQGGKDFNMPLTISSLKDGGKAAQANVRIGDWLSID 
GINAQGMTHLEAQNKIKGCTGSLNMTLQRAS (SEQ ID NO:780) 


LIMK1 


4587498 


1 


TLVEHSKLYCGHCYYQTWTPVIEQILPDSPGSHLPHTVTLVSIPASSHGKRGL 
SVSIDPPHGPPGCGTEHSHTVRVQGVDPGCMSPDVKNSIHVGDRILEINGTPI 
RIMVPL DFIDl 1 IQFTSRL LQLTLFHD /SEQ ID N0781} 


LIMK2 


1805593 


1 


PYSVTLISMPATTEGRRGFSVSVESACSNYATTVQVKEVNRMHISPNNRNAIH 
PGDRILEINGTPVRTLRVEEVEDAISQTSQTLQLLIEHD (SEQ ID NO:782) 


LIM-RIL 


1085021 


1 


IHSVTLRGPSPWGFRLVGRDFSAPLTISRVHAGSKASLAALCPGDLIQAINGES 
TELMTHLEAQNRIKGCHDHLTLSVSRPE (SEQ ID NO:783) 


LU-1 


U52111 


1 


VCYRTDDEEDLGIYVGEVNPNSIAAKDGRIREGDRIIQINGVDVQNREEAVAILS 
QEENTN ISLLVARPESQLA (SEQ ID NO:784) 


MAG 11 


3370997 


1 


IQKKNHWTSRVHECTVKRGPQGELGVTVLGGAEHGEFPYVGAVAAVEAAGL 
PGGGEGPRLGEGELLLEVQGVRVSGLPRYDVLGVIDSCKEAVTFKAVRQGGR 
(SEQ ID NO:785) 


MAG 11 


3370997 


2 


PSELKGKFIHTKLRKSSRGFGFTWGGDEPDEFLQIKSLVLDGPAALDGKMET 
GDVIVSVNDTCVLGHTHAQWKIFQSIPIGASVDLELCRGYPLPFDPDDPN 
(SEQ ID NO:786) 


MAG 11 


3370997 


3 


PATQPELITVHIVKGPMGFGFTIADSPGGGGQRVKQIVDSPRCRGLKEGDLIVE 
VNKKNVQALTHNQWDMLVECPKGSEVTLLVQRGGNLS (SEQ ID NO:787) 


MAG 11 


3370997 


4 


PDYQEQDIFLWRKETGFGFRILGGNEPGEPIYIGHIVPLGAADTDGRLRSGDEL 
ICVDGTPVIGKSHQLWQLMQQAAKQGHVNLTVRRKWFAVPKTENSS (SEQ 
ID NO:788) 


MAG 11 


3370997 


5 


GWSTWQPYDVEIRRGENEGFGFVIVSSVSRPEAGTTFAGNACVAMPHKIGR 
IIEGSPADRCGKLKVGDRILAVNGCSITNKSHSDIVNLIKEAGNTVTLRIIPGDES 

oNA (otU IU NU.f oyj 


MAG 11 


3370997 


6 


QATQEQDFYTVELERGAKGFGFSLRGGREYNMDLYVLRLAEDGPAERCGKM 
RIGDEILEINGETTKNMKHSRAIELIKNGGRRVRLFLKRG (SEQ ID NO:790) 


MGC5395 


BC012477 


1 


PAKMEKEETTRELLLPNWQGSGSHGLTIAQRDDGVFVQEVTQNSPAARTGW 
KEGDQIVGATIYFDNLQSGEVTQLLNTMGHHTVGLKLHRKGDRSPNSS (SEQ 
in mo-7Q1^ 


MINT1 


2625024 


1 


SENCKdVFIEKQKGEILGWIVESGWGSILPTVIIANMMHGGPAEKSGKLNIGDQ 
1 MSI NGTS LVG LP LSTCQSI 1 KGLKNQS RVKLN 1 VRC PPVNSS (SEQ ID NO:792) 


MINT1 


2625024 


2 


LRCPPVTTVLIRRPDLRYQLGFSVQNGIICSLMRGGIAERGGVRVGHRIIEINGQ 
S WATPH E Kl VH 1 LS N AVGEI H M KTM PAAM YRLLNSS (SEQ ID NO:793) 


MINT3 


3169808 


1 


LSNSDNCREVHLEKRRGEGLGVALVESGWGSLLPTAVIANLLHGGPAERSGA 
LSIGDRLTAINGTSLVGLPLAACQAAVRETKSQTSVTLSIVHCPPVTTAIM (SEQ 
ID NO:794) 
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MINT3 


3169808 


2 


LVHCPPVTTAIIHRPHAREQLGFCVEDGIICSLLRGGIAERGGIRVGHRIIEINGQ 
SWATPHARIIELLTEAYGEVHIKTMPAATYRLLTG (SEQ ID NO:795) 


MPP1 


189785 


1 


RKVRLIQFEKVTEEPMGITLKLNEKQSCTVARILHGGMIHRQGSLHVGDEILEIN 
GTNVTNHSVDQLQKAMKETKGMISLKVIPNQ (SEQ ID NO:796) 


MPP2 


939884 


1 


PVPPDAVRMVGIRKTAGEHLGVTFRVEGGELVIARILHGGMVAQQGLLHVGDII 
KEVNGQPVGSDPRALQELLRNASGSVILKILPNYQ (SEQ ID NO:797) 


Ml IDD4 


01 f\A~TQ.A 


4 
1 


piodi_i\/p\/ppi i i/DDcrn ncc\A/n pcpkip^pi ^iP\/OFiPiFfiQ\/AuiprifiPi 
KETDQILAINGQALDQTITHQQAISILQKAKDTVQLVIARGSLPQLV (SEQ ID 
NO:798) 


MUPP1 


2104784 


2 


PVHWQHMETIELVNDGSGLGFGIIGGKATGVIVKTILPGGVADQHGRLCSGDHI 
LKIGDTDLAGMSSEQVAQVLRQCGNRVKLMIARGAIEERTAPT (SEQ ID 
NO:799) 


MUPP1 


2104784 


3 


QESETFDVELTKNVQGLGITIAGYIGDKKLEPSGIFVKSITKSSAVEHDGRIQIGD 
Q I lAVDGTNLQG FTNQQAVE VLRHTGQTVLLTLMRRG MKQEA (SEQ ID 

Ktr\-Qrtn\ 
inu.buu; 


MUPP1 


2104784 


4 


LNYEIWAHVSKFSENSGLGISLEATVGHHFIRSVLPEGPVGHSGKLFSGDELL 
EVNGITLLGENHQDWNILKELPIEVTMVCCRRTVPPT (SEQ ID NO:801) 


MUPP1 


2104784 


5 


WEAGIQHIELEKGSKGLGFSILDYQDPIDPASTVIIIRSLVPGGIAEKDGRLLPGD 
RLMFVNDVNLENSSLEEAVEALKGAPSGTVRIGVAKPLPLSPEE (SEQ ID 

INU.OUZj 


MUPP1 


2104784 


6 


RNVSKESFERTINIAKGNSSLGMTVSANKDGLGMIVRSIIHGGAISRDGRIAIGD 
CILSINEESTISVTNAQARAMLRRHSLIGPDIKITYVPAEHLEE (SEQ ID NO:803) 


MUPP1 


2104784 


7 


LNWNQPRRVELWREPSKSLGISIVGGRGMGSRLSNGEVMRGIFIKHVLEDSP 
AGKNGTLKPGDRIVEVDGMDLRDASHEQAVEAIRKAGNPWFMVQSIINRPRK 
SPLPSLL (SEQ ID NO:804) 


MUPP1 


2104784 


8 


LTGELHMtELEKGHSGLGLSU\GNKDRSRMSVFIVGIDPNGAAGKDGRLQIAD 
ELLEINGQILYGRSHQNASSIIKCAPSKVKIIFIRNKDAVNQ (SEQ ID NO:805) 


MUPP1 


2104784 


9 


LSSFKNVQHLELPKDQGGLGIAISEEDTLSGVIIKSLTEHGVAATDGRLKVGDQI 
LAVDDEIWGYPIEKFISLLKTAKMTVKLTIHAENPDSQ (SEQ ID NO:806) 


MUPP1 


2104784 


10 


LPGCETTIEISKGRTGLGLSIVGGSDTLLGAIIIHEVYEEGAACKDGRLWAGDQI 
LEVNGIDLRKATHDEAINVLRQTPQRVRLTLYRDEAPYKE (SEQ ID NO:807) 


MUPP1 


2104784 


11 


KEEEVCDTLTIELQKKPGKGLGLSIVGKRNDTGVFVSDIVKGGIADADGRLMQ 
GDQILMVNGEDVRNATQEAVAALLKCSLGTVTLEVGRIKAGPFHS (SEQ ID 
INvJ.oUO; 


MUPP1 


2104784 


12 


LQGLRTVEMKKGPTDSLGISIAGGVGSPLGDVPIFIAMMHPTGVAAQTQKLRV 
GDRIVTICGTSTEGMTHTQAVNLLKNASGSIEMQWAGGDVSV (SEQ ID 
NO:809) 


MUPP1 


2104784 


13 


LGPPQCKSITLERGPDGLGFSIVGGYGSPHGDLPIYVKTVFAKGAASEDGRLK 
RGDQIIAVNGQSLEGVTHEEAVAILKRTKGTVTLMVLS (SEQ ID NO:810) 


NeDLG 


10863920 


1 


IQYEEIVLERGNSGLGFSIAGGIDNPHVPDDPGIFITKIIPGGAAAMDGRLGVND 
C VLRVN E VE VS E WH S RAVEALKEAG P WRLWRRRQN (SEQ ID NO:811) 


NeDLG 


10863920 


2 


ITLLKGPKGLGFSIAGGIGNQHIPGDNSIYITKIIEGGAAQKDGRLQIGDRLLAVN 
NTNLQDVRHEEAVASLKNTSDMVYLKVAKPGSLE (SEQ ID NO:812) 


NeDLG 


10863920 


3 


ILLHKGSTGLGFNIVGGEDGEGIFVSFILAGGPADLSGELRRGDRILSVNGVNL 
RNATHEQAAAALKRAGQSVTIVAQYRPEEYSRFESKIHDLREQMMNSSMSSG 
SGSLRTSEKRSLE (SEQ ID NO:813) 


Neurabin II 


AJ401189 


1 


CVERLELFPVELEKDSEGLGISIIGMGAGADMGLEKLGIFVKTVTEGGAAHRDG 
RIQVNDLLVEVDGTSLVGVTQSFAASVLRNTKGRVRFMIGRERPGEQSEVAQ 
RIHRD (SEQ ID NO:814) 


N0S1 


642525 


1 


IQPNVISVRLFKRKVGGLGFLVKERVSKPPVIISDLIRGGAAEQSGLIQAGDIILA 
VNGRPLVDLSYDSALEVLRGIASETHWLILRGP (SEQ ID NO:815) 


novel PDZ gene 


7000-1 77 


4 
1 


OAKIQnPQnilNQ\/P\/Pl^QPAf^PI f^ECV/p/^fiQFU/'SI f^lFV/QkA/FFf^QAFR Af^l 
UnlNoULoUllnoVlAVC^rnoKLorOVnO 

CVGDKITEVNGLSLESTTMGSAVKVLTSSSRLHMMVRRMGRVPGIKFSKEKN 
SS (SEQ ID NO:816) 


novel PDZ gene 


700JH 77 
( ZZol ( f 


z 


KoU I OOtUoVr\r\lVnLT I 1 OUUrULorlNIKoo^ruLUI T VO^VU^o^JLnCC^^I 

GIKVGDQVLAANGVRFDDISHSQAVEVLKGQTHIMLTIKETGRYPAYKEMNSS 
(SEQ ID NO:817) 


INUVCl OCIIIIC 

Protease 






KIKKFI TF^HDROAKGKAITKKKYIGIRMM^l T^KAKFLKDRHRDFPDVISGA 
YIIEVIPDTPAEAGGLKENDVIISINGQSWSANDVSDVIKRESTLNMWRRGNE 
DIMITV(SEQ ID NO:818) 


Numb Binding 
Protein 


AK056823 


1 


PDGEITSIKINRVDPSESLSIRLVGGSETPLVHIIIQHIYRDGVIARDGRLLPGDIIL 
KVNGMDISNVPHNYAVRLLRQPCQVLWLTVMREQKFRSRNSS (SEQ ID 
NO:819) 


Numb Binding 
Protein 


AK056823 


2 


HRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHGQLEE 
NDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPENSS (SEQ 
ID NO:820) 
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Numb Binding 
Protein 


AKOooozo 


6 


rill UnbrW VNIUKUroboLolvl 1 VAooAonKtWULr 1 T Vlo VCrou vlor\Uor\l 

KTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEFIV (SEQ 
ID NO:821) 


Numb Binding 
Protein 


AK056823 


4 


PRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDGRI 
RCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL (SEQ ID 


Outer Membrane 


7023825 


1 


LLTEEEINLTRGPSGLGFNIVGGTDQQYVSNDSGIYVSRIKENGAAALDGRLQE 
GDKILSVNGQDLKNLLHQDAVDLFRNAGYAVSLRVQHRLQVQNGIHS (SEQ 
ID NO:823) 


p55T 


12733367 


1 


PVDAIRILGIHKRAGEPLGVTFRVENNDLVIARILHGGMIDRQGLLHVGDIIKEVN 
GHEVGNNPKELQELLKNISGSVTLKILPSYRDTITPQQ (SEQ ID NO:824) 


PAR3 


8037914 


1 


DDMVKLVEVPNDGGPLGIHWPFSARGGRTLGLLVKRLEKGGKAEHENLFRE 
NDCIVRINDGDLRNRRFEQAQHMFRQAMRTPIIWFHWPAA (SEQ ID NO:825) 


PAR3 


8037914 


2 


GKRLNIQLKKGTEGLGFSITSRDVTIGGSAPIYVKN1LPRGAAIQDGRLKAGDRL 
IEVNGVDLVGKSQEEWSLLRSTKMEGTVSLLVFRQEDA(SEQ ID NO:826) 


PAR3 


8037914 


3 


TPDGTKbrLTrtVrLNDbobACjLoVoVK^ 

KDGRLRVNDQLIAVNGESLLGKTNQDAMETLRRSMSTEGNKRGMIQLIVA 
(SEQ ID NO:827) 


PAR6 


261301 1 


1 


LPbTHRRVRLnKHGoUKPLGrYIKULiMo VKVArUoLbKVKolrloKLVKljoLA 
ESTGLLAVSDEILEVNGIEVAGKTLDQVTDMMVANSHNLIVTVKPANQR (SEQ 
ID NO:828) 


PAR6 GAMMA 


135371 18 


1 


IDVDLVPb 1 nKRVKLHRrKjL/cKrLCpr YIKUoAoVKV 1 rnULbfWrolrloKIVIVr 
GGLAESTGLLAVNDEVLEVNGIEVAGKTLDQVTDMMIANSHNLiVTVKPANQR 
NNW (SEQ ID NO:829) 


PDZ-73 


5031978 


1 


KoKKLKbVKLUKLnHboLoLo vKooLtrbOoLrloML^buUMUoVoLUVoUC 
IVRINGYSISSCTHEEVINLIRTKKTVSIKVRHIGLIPVKSSPDEFH (SEQ ID 
NO:830) 


PDZ-73 


5UJ19/0 




ID^MDCMLf CLf LA/CIQI V/rCDPI r'^CICQf^Dirik'DniPIQI-IVKPfiQI QAFV/f^l Plf^ 

lruNKbNI\bl\l\VNoLVooKoLouolooori 

DQIVEVNGVDFSNLDHKEAVNVLKSSRSLTISIVAAAGRELFMTDEF (SEQ ID 
NO:831) 


PDZ-73 


5031978 


3 


PEQIMGKDVRLLRIKKEGSLDLALEGGVDSPIGKVWSAVYERGAAERHGGIV 
KGDEIMAINGKIVTDYTLAEADAALQKAWNQGGDWIDLWAVCPPKEYDD 
(SEQ ID NO:832) 


PDZK1 


2944188 


1 


LTSTFNPRECKLSKQEGQNYGFFLRIEKDTEGHLVRWEKCSPAEKAGLQDG 
DRVLRINGVFVDKEEHMQWDLVRKSGNSVTLLVLDGDSYEKAGSPGIHRD 
(obU ID NU.ooo) 


PDZK1 


2944188 


2 


RLCYLVKEGGSYGFSLKTVQGKKGVYMTDITPQGVAMRAGVLADDHLIEVNG 
ENVEDASHEEWEKVKKSGSRVMFLLVDKETDKREFIVTD (SEQ ID NO:834) 


PDZK1 


2944188 


3 


QFKRETASLKLLPHQPRIVEMKKGSNGYGFYLRAGSEQKGQIIKDIDSGSPAE 
EAGLKNNDLWAVNGESVETLDHDSNA/EMIRKGGDQTSLLWDKETDNMYRL 
AbNV I U (obU IU IMU.ooo; 


PDZK1 


2944188 


4 


PDTTEEVDHKPKLCRLAKGENGYGFHLNAIRGLPGSFIKEVQKGGPADLAGLE 
DEDVIIEVNGVNVLDEPYEKWDRIQSSGKNVTLLVZGKNSS (SEQ ID NO:836) 


PICK1 


4678411 


1 


PTVPGKVTLQKDAQNLtGISIGGGAQYCPCLYIVQVFDNTPAALDGTVAAGDEI 
TG VNG RS I KGKTKVE VAKM IQE VKG E VTI H YNKLQ (SEQ ID NO:837) 


PIST 


98374330 


1 


SQGVGPIRKVLLLKEDHEGLGISITGGKEHGVPILISEIHPGQPADRCGGLHVG 
DAILAVNGVNLRDTKHKEAVTILSQQRGEIEFEWYVAPEVDSD (SEQ ID 
NO:838) 


prlL16 


1478492 


1 


IHVTILHKEEGAGLGFSLAGGADLENKViTVHRVFPNGLASQEGTIQKGNEVLSI 
NG KS LKGTTH H DALAI LRQAREPRQA VI VTRKLTPE EF IVTD (SEQ ID NO:839) 


prlL16 


1478492 


2 


TAEATVCTvTLbKMSAoLCjroLbUCaKooLnoU I INKIhKoAAobUob I VU 
PGDEILQLGGTAMQGLTRFEAWNIIKALPDGPVTIVIRRKSLQSK (SEQ ID 
NO:840) 


PSD95 


3318652 


1 


LEYEelTLERGNSGLGFSIAGGTDNPHIGDDPSIFITKIIPGGAAAQDGRLRVND 
SILFVNEVDVREVTHSAAVEALKEAGSIVRLYVMRRKPPAENSS (SEQ ID 

INL/.04 1 ; 


PSD95 


3318652 


2 


HVMRRKPPAEKVMEIKLIKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIEGGAA 
HKDGRLQIGDKILAVNSVGLEDVMHEDAVAALKNTYDWYLKVAKPSNAYL 
(SEQ ID NO:842) 


PSD95 


3318652 


3 


REDIPREPRRIVIHRGSTGLGFNIVGGEDGEGIFISFILAGGPADLSGELRKGDQ 
ILSVNGVDLRNASHEQAAIALKNAGQTVTIIAQYKPEFIVTD (SEQ ID NO:843) 


PTN-3 


179912 


1 


LIRITPDEDGKFGFNLKGGVDQKMPLWSRINPESPADTCIPKLNEGDQIVLING 
RDISEHTHDQWMFIKASRESHSRELALVIRRR (SEQ ID NO:844) 


PTN-4 


190747 


1 


IRMKPDENGRFGFNVKGGYDQKMPVIVSRVAPGTPADLCVPRLNEGDQWLI 
NGRD1AEHTHDQWLFIKASCERHSGELMLLVRPNA (SEQ ID NO:845) 


PTPL1 


515030 


1 


PEREITLVNLKKDAKYGLGFQIIGGEKMGRLDLGIFISSVAPGGPADFHGCLKP 
GDRLISVNSVSLEGVSHHAAIEILQNAPEDVTLVISQPKEKISKVPSTPVHL 
(SEQ ID NO:846) 
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PTPL1 


515030 


2 


GDIFcVELJVKNDNbLGIbVl GGVNTbVKHGGIYVIvW^ 
VLAVNGVSLEGATHKQAVETLRNTGQWHLLLEKGQSPTSK (SEQ ID 
NO:847) 


PTPL1 


515030 


3 


TEENTFEVKLFKNSSGLGFSFSREDNLIPEQINASIVRVKKLFAGQPAAESGKID 
VGDVILKVNGASLKGLSQQEVISALRGTAPEVFLLLCRPPPGVLPEIDT (SEQ 
ID NO:848) 


PTPL1 


515030 


4 


ELEVELLITLIKSEKASLGFTVTKGNQRIGCYVHDVIQDPAKSDGRLKPGDRLIK 
VNDTDVTNMTHTDAVNLLRAASKTVRLVIGRVLELPRIPMLPH (SEQ ID 
NO:849) 


PTPL1 


515030 


5 


MLPHLLPDITLTCNKEELGFSLCGGHDSLYQWYISDINPRSVAAIEGNLQLLDV 
IHYVNGVSTQGMTLEEVNRALDMSLPSLVLKATRNDLPV(SEQ ID NO:850) 


RGS12 


3290015 




R PSP P R VRS VE VARG RAG YG FT Lb G QAPG VLb G V M KG b PAD F VG LKAG U U 1 
LAVNEINVKKASHEDWKLIGKCSGVLHMVIAEGVGRFESCS (SEQ ID 
NO:851) 


RGS3 


A 0£> A A T5 C 

18644735 




LCSERRYRQITIPRGKDGFGFT ICCDbPVRVUAVDbGGPAbKAGLUULD 1 VLU 
LN ERPVEH WKC VE LAH E 1 RSCPS El 1 LL VWRM VPQVKPG 1 H RD (SEQ ID 
NO:852) 


Rhophilin-like 


1 4279408 




iprcAkii/DiArrDnnciDCTACcAni /tti d/^ki a d\//"4\/ljci r\DVr k OAC\/ArADCP 
ISFSANKKVvrPPRbiRFTAttGDLGFTLKGNAPvUVnrLIJrY 

DYIVSIQLVDCKWLTLSEVMKLLKSFGEDEIEMKWSLLDSTSSMHNKSAT 

(SEQ ID NO:853) 


Serine Protease 


2738914 




RGEKKNSSSGISGbQRRYIGVMMLTLorblLAcLULRtrbrrDVUnGVLIn 
LGSPAHRAGLRPGDVILAIGEQMVQNAEDVYEAVRTQSQLAVQ1RRGRETLTL 
YV (SEQ ID NO:854) 


Shank 1 


6049185 


1 


EEKTWLQKKDNEGFGFVLRGAKADTPIEEFTPTPAFPALQYLESVDEGGVAW 
C^GLRTGDFLIEVNNENWKVGHRQWNMIRQGGNHLVLKWTVTRNLDPDD 
TAKKKA (bbQ ID NO. 855} 


Shank 3 


* 




SDYVIDDKVAVLQKRDHEGFGFVLRGAKAETPIEEFTPTPAFPALQYLESVDVE 
GVAWRAGLRTGDFLIEVNGVNWKVGHKQWALIRQGGNRLVMKWSVTRKP 
EEDG (SEQ ID NO:856) 


Shroom 


18652858 


1 


IYLEAFLEGGAPWGFTLKGGLEHGEPLMSKVEEGGKADTLSSKLQAGDEWHI 
NbVTLSSSRKEAVSLVKGSYKTLRLWRRDVG 1 DPGH (bbU ID NO. 857) 


SIP1 


2047327 


1 


IRLCRLVRGEQGYGFHLHGEKGRRGQFIRRVEPGSPAEAAALRAGDRLVEVN 
GVNVbGETHHQWQRIKAVbGQTRLLWDUN (bbU ID NO.oooJ 


SIP1 


2047327 




IRHLRKGPQGYGFNLHSDKSRPGQYIRSVDPGSPAARSGLRAQDRLIEVNGQ 
N VEGLRH AE WAS 1 KARE D EARLLWDPETD E (SEQ ID NO:859) 


SITAC-18 


8886071 




PGVREIHLCKDERGKTGLRLRKVDQGLFVQLVQANTPASLVGLRFGDQLLQID 
GRDCAGWSSHKAHQWKKASGDKIWWRDRPFQRTVTM (SEQ ID NO:860) 


SITAC-18 


8886071 




PFQRTVTMHKDSMGHVGFVIKKGKIVSLVKGSSAARNGLLTNHYVCEVDGQN 
VIGLKDKKIMEILATAGNWTLTIIPSVIYEHIVEFIV (SEQ ID NO:861) 


SSTRIP 


7025450 




LKEKTVLLQKKDSEGFGFVLRGAKAQTPIEEFTPTPAFPALQYLESVDEGGVA 
WRAGLRMGDFLIEVNGQNWKVGHRQVVNMIRQGGNTLMVKWMVTRHPD 
MDEAVQ (SEQ ID NO:862) 


SYNTENIN 


2795862 


1 


LEIKQGIREVILCKDQDGKIGLRLKSIDNGiFVQLVQANSPASLVGLRFGDQVLQ 
INGENCAGWSSDKAHKVLKQAFGbKITMRIHRD (SEQ ID NO.863) 


SYNTENIN 


2795862 




RDRPFERTITMHKDSTGHVGFIFKNGKITSIVKDSSAARNGLLTEHNICEINGQN 
VIGLKDSQIADILSTSGNSS (SEQ ID NO:864) 


Syntrophin 1 
alpha 


1145727 


1 


QRRRVTVRKADAGGLGISIKGGRENKMPILISKIFKGLAADQTEALFVGDAILSV 
NGEDLSSATHDEAVQVLKKTGKEWLEVKYMKDVSPYFK (SEQ ID NO:865) 


Syntrophin beta 
2 


476700 


1 


IRWKQEAGGLGISIKGGRENRMPILISKIFPGLAADQSRALRLGDAILSVNGTD 
LRQATHDQAVQALKRAGKEVLLEVKFIREFIVTD (SEQ ID NO:866) 


Syntrophin 
gamma 1 


9507162 


1 


EPFYSGERTVTIRRQTVGGFGLSIKGGAEHNIPVWSKISKEQRAELSGLLFIGD 
AILQINGINVRKCRHEEWQVLRNAGEEVTLTVSFLKRAPAFLKLP (SEQ ID 

NO.O07) 


Syntrophin 
gamma 2 


9507164 


1 


SHQGRNRRTVTLRRQPVGGLGLSIKGGSEHNVPWISKIFEDQAADQTGMLFV 
GDAVLQVNGIHVENATHEEWHLLRNAGDEVTITVEYLREAPAFLK (SEQ ID 
NO.obo) 


TAX2-like 
protein 


3253116 


1 


RGETKEVEVTKTEDALGLTITDNGAGYAFIKRIKEGSIINRIEAVCVGDSIEAIND 
HSIVGCRHYEVAKMLRELPKSQPFTLRLVQPKRAF (SEQ ID NO:869) 


TIAM1 


4507500 


1 


HSIHIEKSDTAADTYGFSLSSVEEDGIRRLYVNSVKETGLASKKGLKAGDEILEI 
NNRAADALNSSMLKDFLSQPSLGLLVRTYPELE (SEQ ID NO:870) 


TIAM2 


6912703 




PLNVYDVQLTKTGSVCDFGFAVTAQVDERQHLSRIFISDVLPDGLAYGEGLRK 
GNEIMTLNGEAVSDLDLKQMEALFSEKSVGLTLIARPPDTKATL (SEQ ID 
NO:871) 


TIP1 


2613001 




QRVEIHKLRQGENLILGFSIGGGIDQDPSQNPFSEDKTDKGIYVTRVSEGGPAE 
IAGLQIGDKIMQVNGWDMTMVTHDQARKRLTKRSEEWRLLVTRQSLQK 
(SEQ ID NO:872) 


TIP2 


2613003 




RKEVEVFKSEDALGLTITDNGAGYAFIKRIKEGSVIDHIHLISVGDMIEAINGQSL 
LGCRH YEVARLLKELPRG RTFTLKLTEPRK (SEQ ID NO:873) 
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TIP33 


2613007 


1 


HSHPRWELPKTDEGLGFNVMGGKEQNSPIYISRIIPGGVAERHGGLKRGDQL 
LSVNGVSVEGEHHEKAVELLKAAKDSVKLWRYTPKVL (SEQ ID NO:874) 


TIP43 


2613011 


1 


ISNQKRGVKVLKQELGGLGISIKGGKENKMPILISKIFKGLAADQTQALYVGDAI 
LSVNGADLRDATHDEAVQALKRAGKEVLLEVKYMREATPYV (SEQ ID 
N0:875) 


X-11 beta 


3005559 


1 


IHFSNSENCKELQLEKHKGEILGNAAA/ESGWGSILPTVILANMMNGGPAARSG 
KLSIGDQIMSINGTSLVGLPLATCQGIIKGLKNQTQVKLNIVSCPPVTTVLIKRNS 
o (obU IU NU.Ofb) 


X-1 1 beta 


3005559 


2 


IPPVTTVLIKRPDLKYQLGFSVQNGIICSLMRGGIAERGGVRVGHRIIEINGQSV 
VATAHEKIVQALSNSVGEIHMKTMPAAMFRLLTGQENSS (SEQ ID N0:877) 


ZO-1 


292937 


1 


IWEQHTVTLHRAPGFGFGIAISGGRDNPHFQSGETSIVISDVLKGGPAEGQLQ 
ENDRVAMVNGVSMDNVEHAFAVQQLRKSGKNAKITIRRKKKVQIPNSS (SEQ 


20-1 


292937 


2 


ISSQPAKPTKVTLVKSRKNEEYGLRLASHIFVKEISQDSLAARDGNIQEGDWL 
KINGTVTENMSLTDAKTLIERSKGKLKMWQRDRATLLNSS (SEQ ID N0;879) 


20-1 


292937 


3 


IRMKLVKFRKGDSVGLRLAGGNDVGIFVAGVLEDSPAAKEGLEEGDQILRVNN 
VDFTNIIREEAVLFLLDLPKGEEVTILAQKKKDVFSN (SEQ ID NO:880) 


20-2 


12734763 


1 


LIWEQYTVTLQKDSKRGFGIAVSGGRDNPHFENGETSIVISDVLPGGPADGLL 
QENDRWMVNGTPMEDVLHSFAVQQLRKSGKVAAIWKRPRKV (SEQ ID 
iMU.ool ) 


Z0-2 


12734763 


2 


RVLLMKSRANEEYGLRLGSQIFVKEMTRTGLATKDGNLHEGDIILKINGTVTEN 
MSLTDARKLIEKSRGKLQLWLRDS (SEQ ID NO:882) 


Z0-2 


12734763 


3 


HAPNTKMVRFKKGDSVGLRLAGGNDVGIFVAGIQEGTSAEQEGLQEGDQILK 
VNTQDFRGLVREDAVLYLLEIPKGEMVTILAQSRADVY (SEQ ID N0:883) 




luuy^oyu 


1 


IroNo 1 IWtUn 1 A 1 LolNUrKKorulAloooKUKKoooMVVoUVV 
LQTGDHIVMVNGVSMENATSAFAIQILKTCTKMANITVKRPRRIHLPAEFIVTD 
(SEQ ID NO:884) 


Z0-3 


10092690 


2 


QDVQMKPVKSVLVKRRDSEEFGVKLGSQIFIKHITDSGLAARHRGLQEGDLIL 
QINGVSSQNLSLNDTRRLIEKSEGKLSLLVLRDRGQFLVNIPNSS (SEQ ID 
NO:885) 


Z0-3 


10092690 


3 


RGYSPDTRWRFLKGKSIGLRLAGGNDVGIFVSGVQAGSPADGQGIQEGDQIL 
QVNDVPFQNLTREEAVQFLLGLPPGEEMELVTQRKQDIFWKMVQSEFIVTD 
(SEQ ID NO:886) 


*: No Gl number for this PDZ domain containing protein - it was computer cloned by J.S. using rat Shank3 seq against 
human genomic clone AC000036. In silico spliced together nt6400-6496, 6985-7109, 7211-7400 to create hypothetical 

human Shank3. 
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